PyTorch Issue 自动分类技能：GitHub Issue 智能路由与标签管理工具

triaging-issues by pytorch/pytorch

173 周安装量

98,500 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/pytorch/pytorch --skill triaging-issues

自动化开发运维开源项目

🇨🇳中文介绍

包含钩子

此技能使用 Claude 钩子，可以自动响应事件执行代码。安装前请仔细审查。

PyTorch Issue Triage Skill

此技能通过路由问题、应用标签和留下首轮回复，帮助对 GitHub issue 进行分类处理。

可用的 MCP 工具
绝对禁止添加的标签
Issue 分类处理步骤
- 步骤 0：已路由 — 跳过
- 步骤 1：问题与 Bug/功能请求的区分
- 步骤 1.5：需要复现 — 外部文件
- 步骤 2：转移
- 步骤 2.5：PT2 问题 — 特殊处理
- 步骤 3：重定向至次级 Oncall
- 步骤 4：标记 Issue
- 步骤 5：高优先级 — 需要人工审核
- 步骤 6：bot-triaged（自动）
- 步骤 7：标记为已分类
V1 限制

标签参考： 查看 labels.json 获取适用于分类处理的完整 305 个标签目录。仅应用此文件中存在的标签。 不要发明或猜测标签名称。此文件排除了 CI 触发器、测试配置、发布说明和已弃用的标签。

PT2 分类指南： 查看 pt2-triage-rubric.md 获取处理 PT2/torch.compile 问题时详细的标签应用指南。

回复模板： 查看 templates.json 获取标准回复消息。

可用的 MCP 工具

使用以下 GitHub MCP 工具进行分类处理：

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

相关 Skills

FlyClaw：零登录航班聚合查询工具，Python实现多源航班信息与价格搜索

4,000,000 周安装

Vercel React 最佳实践指南 | 58条Next.js性能优化规则与代码重构

278,000 周安装

agent-browser 浏览器自动化工具 - Vercel Labs 命令行网页操作与测试

152,900 周安装

Azure Data Explorer (Kusto) 查询技能：KQL数据分析、日志遥测与时间序列处理

125,100 周安装

工具	用途
`mcp__github__issue_read`	获取 issue 详情、评论和现有标签
`mcp__github__issue_write`	应用标签或关闭 issue
`mcp__github__add_issue_comment`	添加评论（仅用于重定向问题）
`mcp__github__search_issues`	查找类似 issue 以获取上下文

前缀/类别	原因
不在 `labels.json` 中的标签	仅应用白名单中存在的标签
`ciflow/*`	仅用于 PR 的 CI 作业触发器
`test-config/*`	仅用于 PR 的测试套件选择器
`release notes: *`	为发布说明自动分配
`ci-`, `ci:`	CI 基础设施控制
`sev*`	严重性标签需要人工决定
`merge blocking`	需要人工决定
任何包含 "deprecated" 的标签	已过时
`oncall: releng`	不是分类重定向目标。请改用 `module: ci`

文件附件：.zip、.pt、.pth、.pkl、.safetensors、.onnx、.bin 文件
外部存储：Google Drive、Dropbox、OneDrive、Mega、WeTransfer 链接
模型中心：指向模型文件的 Hugging Face Hub 链接

编辑 issue 正文 以移除/编辑下载链接
- 替换为：[链接已移除 - 出于安全原因，不允许外部文件下载]
添加 needs reproduction 标签
使用 templates.json 中的 needs_reproduction 模板请求一个自包含的复现示例
不要添加 triaged — 等待用户提供可复现的示例

在 import torch 时出现的 undefined symbol: ncclAlltoAll 错误是打包问题（module: binaries），而不是分布式训练 bug — 用户从未运行分布式代码。
参数名或容差检查中的 nan 不是 module: NaNs and Infs，除非 bug 实际上是关于 NaN 传播。
提到 autograd 的堆栈跟踪并不意味着 module: autograd — 检查 bug 是在 autograd 本身还是仅仅在调用路径上。
带有容差阈值的测试失败是 module: tests，而不是 module: numerical-stability。

添加 module: edge cases 标签
如果来自模糊测试，也添加 topic: fuzzer
使用 templates.json 中的 numerical_accuracy 模板链接到文档
如果根据文档该问题明显是预期行为，则使用模板评论关闭它

PT2 不是重定向。 oncall: pt2 与步骤 3 中的其他 oncall 标签不同。PT2 问题继续执行步骤 4–7 以完成分类 — 添加 oncall: pt2，然后继续使用 module: 标签进行标记，标记为 triaged 等。

标签	何时使用
`oncall: jit`	TorchScript 问题
`oncall: distributed`	分布式训练（DDP、FSDP、RPC、c10d、DTensor、DeviceMesh、对称内存、上下文并行、流水线）
`oncall: export`	torch.export 问题
`oncall: quantization`	量化问题
`oncall: mobile`	移动端（iOS/Android），不包括 ExecuTorch
`oncall: profiler`	性能分析器问题（CPU、GPU、Kineto）
`oncall: visualization`	TensorBoard 集成

MPS ≠ 移动端。 MPS（Metal Performance Shaders）是 macOS/Apple Silicon GPU 后端。不要将 MPS 问题路由到 oncall: mobile。MPS 问题保留在通用队列中，使用 module: mps。
DTensor → oncall: distributed。 DTensor 问题应始终路由到 oncall: distributed，即使它们没有提到 DDP/FSDP。
ONNX → module: onnx。 没有 oncall: onnx。使用 module: onnx 并保留在通用队列中。
CI/releng → module: ci。 不要使用 oncall: releng。对于 CI 基础设施问题，使用 module: ci。
torch.compile + 分布式。 当 torch.compile 错误处理分布式操作（例如 dist.all_reduce）时，该问题通常需要同时添加 oncall: pt2 和 oncall: distributed，因为修复可能涉及两个代码库。

根据受影响区域添加 1 个或多个 module: ... 标签
当特定标签和通用标签同时存在时，优先使用特定标签。查看 labels.json 中的描述，了解特定标签何时取代通用标签的指导（例如，对于 SDPA 问题，使用 module: sdpa 而不是 module: nn；对于灵活注意力问题，使用 module: flex attention 而不是 module: nn）。
feature — 全新的功能，目前以任何形式都不存在
enhancement — 对已有效工作的内容的改进（例如，为已通过回退/组合方式运行的操作添加原生后端内核、性能优化、更好的错误消息）。如果增强是关于性能的，也添加 module: performance。
function request — 新函数或现有函数的新参数/模式
如果 issue 说操作“目前有效”或“回退到”较慢的路径，那是 enhancement，而不是 feature

条件	标签
段错误、非法内存访问、SIGSEGV	`module: crash`
性能问题：回归、速度变慢或优化请求	`module: performance`
Windows 上的问题	`module: windows`
先前有效的功能现在损坏	`module: regression`
先前有效的损坏文档/链接	`module: docs` + `module: regression`（不是 `enhancement`）
关于测试失败的问题（不是底层功能）	`module: tests`
反向传播/梯度计算 bug	`module: autograd`（除了操作的模块标签外）
`torch.linalg` 操作或线性代数操作（solve、svd、eig、inv 等）	`module: linear algebra`
`has workaround`	仅当变通方法是非平凡且非显而易见时才添加。如果问题是“X 对非连续张量无效”，调用 `.contiguous()` 是 bug 的同义反复逆，而不是变通方法。真正的变通方法是诸如安装特定包版本、添加同步点、插入 `gc.collect()` 或使用 bug 描述中未明确暗示的不同 API。

关闭明确的使用问题并指向 discuss.pytorch.org（根据步骤 1）
保持保守 — 如有疑问，添加 triage review 以引起人工注意
在确信时应用类型标签（feature、enhancement、function request）
分类完成时添加 triaged 标签

🇺🇸English

Contains Hooks

This skill uses Claude hooks which can execute code automatically in response to events. Review carefully before installing.

PyTorch Issue Triage Skill

This skill helps triage GitHub issues by routing issues, applying labels, and leaving first-line responses.

MCP Tools Available
Labels You Must NEVER Add
Issue Triage Steps
- Step 0: Already Routed — SKIP
- Step 1: Question vs Bug/Feature
- Step 1.5: Needs Reproduction — External Files
- Step 2: Transfer
- Step 2.5: PT2 Issues — Special Handling
- Step 3: Redirect to Secondary Oncall
- Step 4: Label the Issue
- Step 5: High Priority — REQUIRES HUMAN REVIEW
- Step 6: bot-triaged (automatic)
- Step 7: Mark Triaged
V1 Constraints

Labels reference: See labels.json for the full catalog of 305 labels suitable for triage. ONLY apply labels that exist in this file. Do not invent or guess label names. This file excludes CI triggers, test configs, release notes, and deprecated labels.

PT2 triage guide: See pt2-triage-rubric.md for detailed labeling guidance when triaging PT2/torch.compile issues.

Response templates: See templates.json for standard response messages.

MCP Tools Available

Use these GitHub MCP tools for triage:

Tool	Purpose
`mcp__github__issue_read`	Get issue details, comments, and existing labels
`mcp__github__issue_write`	Apply labels or close issues
`mcp__github__add_issue_comment`	Add comment (only for redirecting questions)
`mcp__github__search_issues`	Find similar issues for context

Labels You Must NEVER Add

Prefix/Category	Reason
Labels not in `labels.json`	Only apply labels that exist in the allowlist
`ciflow/*`	CI job triggers for PRs only
`test-config/*`	Test suite selectors for PRs only
`release notes: *`	Auto-assigned for release notes
`ci-`, `ci:`	CI infrastructure controls
`sev*`

If blocked: When a label is blocked by the hook, add ONLY triage review and stop. A human will handle it.

These rules are enforced by a PreToolUse hook that validates all labels against labels.json.

Never Override Human Labels

If a human has already applied labels (especially ci: sev, severity labels, or priority labels), do NOT remove or replace them. Your job is to supplement, not override.

Issue Triage (for each issue)

0) Already Routed — SKIP

If an issue already has ANYoncall: label, SKIP IT entirely. Do not:

Add any labels
Add triaged
Leave comments
Do any triage work

That issue belongs to the sub-oncall team. They own their queue.

1) Question vs Bug/Feature

If it is a question (not a bug report or feature request): close and use the redirect_to_forum template from templates.json.
If unclear whether it is a bug/feature vs a question: request additional information using the request_more_info template and stop.

1.5) Needs Reproduction — External Files

Check if the issue body contains links to external files that users would need to download to reproduce.

Patterns to detect:

File attachments: .zip, .pt, .pth, .pkl, .safetensors, .onnx, .bin files
External storage: Google Drive, Dropbox, OneDrive, Mega, WeTransfer links
Model hubs: Hugging Face Hub links to model files

Action:

Edit the issue body to remove/redact the download links
- Replace with: [Link removed - external file downloads are not permitted for security reasons]
Add needs reproduction label
Use the needs_reproduction template from templates.json to request a self-contained reproduction
Do NOT add triaged — wait for the user to provide a reproducible example

1.55) Needs Reproduction — Other Cases

Also add needs reproduction when:

The user reports a hardware-specific issue (e.g., specific GPU model) without a self-contained repro script
The user references a specific model/checkpoint/dataset that is not publicly runnable in a few lines
The issue describes version-upgrade breakage but only provides a high-level description without a minimal script
The repro depends on a specific training setup, distributed environment, or non-trivial infrastructure

1.6) Edge Cases & Numerical Accuracy

If the issue involves extremal values or numerical precision differences:

Patterns to detect:

Values near torch.finfo(dtype).max or torch.finfo(dtype).min
NaN/Inf appearing in outputs from valid (but extreme) inputs
Differences between CPU and GPU results
Precision differences between dtypes (e.g., fp32 vs fp16)
Fuzzer-generated edge cases

IMPORTANT — avoid keyword-triggered mislabeling:

Label based on the root cause , not keywords that appear in the error or title. A keyword tells you what failed, not why.

An undefined symbol: ncclAlltoAll error at import torch is a packaging issue (module: binaries), not a distributed training bug — the user never ran distributed code.
A nan in a parameter name or tolerance check is not module: NaNs and Infs unless the bug is actually about NaN propagation.
A stack trace mentioning autograd does not mean module: autograd — check whether the bug is in autograd itself or just on the call path.
A test failure with tolerance thresholds is module: tests, not module: numerical-stability.

Ask: "Where would the fix need to be made?" That determines the label.

Action:

Add module: edge cases label
If from a fuzzer, also add topic: fuzzer
Use the numerical_accuracy template from templates.json to link to the docs
If the issue is clearly expected behavior per the docs, close it with the template comment

2) Transfer (domain library or ExecuTorch)

If the issue belongs in another repo (vision/text/audio/RL/ExecuTorch/etc.), transfer the issue and STOP.

2.5) PT2 Issues — Special Handling

PT2 is NOT a redirect. oncall: pt2 is not like the other oncall labels in Step 3. PT2 issues continue through Steps 4–7 for full triage — add oncall: pt2, then proceed to label with module: labels, mark triaged, etc.

See pt2-triage-rubric.md for detailed labeling decisions on which module: labels to apply.

3) Redirect to Secondary Oncall

CRITICAL: When redirecting issues to a non-PT2 oncall queue, apply exactly one oncall: ... label and STOP. Do NOT:

Add any module: labels
Mark it triaged
Do any further triage work

The sub-oncall team will handle their own triage. Your job is only to route it to them.

Oncall Redirect Labels

Label	When to use
`oncall: jit`	TorchScript issues
`oncall: distributed`	Distributed training (DDP, FSDP, RPC, c10d, DTensor, DeviceMesh, symmetric memory, context parallel, pipelining)
`oncall: export`	torch.export issues
`oncall: quantization`	Quantization issues
`oncall: mobile`	Mobile (iOS/Android), excludes ExecuTorch
`oncall: profiler`

Common routing mistakes to avoid:

MPS ≠ Mobile. MPS (Metal Performance Shaders) is the macOS/Apple Silicon GPU backend. Do NOT route MPS issues to oncall: mobile. MPS issues stay in the general queue with module: mps.
DTensor →oncall: distributed. DTensor issues should always be routed to oncall: distributed, even if they don't mention DDP/FSDP.
ONNX →module: onnx. There is no oncall: onnx. Use module: onnx and keep in the general queue.
CI/releng →module: ci. Do not use oncall: releng. Use for CI infrastructure issues.

Note: oncall: cpu inductor is a sub-queue of PT2. For general triage, just use oncall: pt2.

4) Label the issue (if NOT transferred/redirected)

Only if the issue stays in the general queue:

Add 1+ module: ... labels based on the affected area
Prefer specific labels over general ones when both exist. Check labels.json descriptions for guidance on when a specific label supersedes a general one (e.g., module: sdpa instead of module: nn for SDPA issues, module: flex attention instead of module: nn for flex attention).
feature — wholly new functionality that does not exist today in any form
enhancement — improvement to something that already works (e.g., adding a native backend kernel for an op that already runs via fallback/composite, performance optimization, better error messages). If the enhancement is about performance, also add module: performance.

Commonly missed labels — always check for these:

Condition	Label
Segfault, illegal memory access, SIGSEGV	`module: crash`
Performance issue: regression, slowdown, or optimization request	`module: performance`
Issue on Windows	`module: windows`
Previously working feature now broken	`module: regression`
Broken docs/links that previously worked	`module: docs` + `module: regression` (NOT `enhancement`)

Label based on the actual bug, not keywords. Read the issue to understand what is actually broken. A bug about broadcasting that happens to mention "nan" in a parameter name is a frontend bug, not a NaN/Inf bug.

5) High Priority — REQUIRES HUMAN REVIEW

CRITICAL: If you believe an issue is high priority, you MUST:

Add triage review label and do not add triaged

Do NOT directly add high priority without human confirmation.

High priority criteria:

Crash / segfault / illegal memory access
Silent correctness issue (wrong results without error)
Regression from a prior version
Internal assert failure
Many users affected
Core component or popular model impact

6) bot-triaged (automatic)

The bot-triaged label is automatically applied by a post-hook after any issue mutation. You do not need to add it manually.

7) Mark triaged

If not transferred/redirected and not flagged for review, add triaged.

V1 Constraints

DO NOT:

Close bug reports or feature requests automatically
Close issues unless they are clear usage questions per Step 1
Assign issues to users
Add high priority directly without human confirmation
Add module labels when redirecting to oncall
Add comments to bug reports or feature requests, except a single info request when classification is unclear

DO:

Close clear usage questions and point to discuss.pytorch.org (per step 1)
Be conservative - when in doubt, add triage review for human attention
Apply type labels (feature, enhancement, function request) when confident
Add triaged label when classification is complete

Note: bot-triaged is automatically applied by a post-hook after any issue mutation.

Weekly Installs

173

Repository

pytorch/pytorch

GitHub Stars

98.5K

First Seen

Jan 29, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykWarn

Installed on

opencode169

gemini-cli168

codex167

cursor166

github-copilot165

claude-code163

PyTorch Issue 自动分类技能：GitHub Issue 智能路由与标签管理工具

🇨🇳中文介绍

PyTorch Issue Triage Skill

目录

可用的 MCP 工具

相关 Skills

绝对禁止添加的标签

切勿覆盖人工添加的标签

Issue 分类处理（针对每个 issue）

0) 已路由 — 跳过

1) 问题与 Bug/功能请求的区分

1.5) 需要复现 — 外部文件

1.55) 需要复现 — 其他情况

1.6) 边缘情况与数值精度

2) 转移（领域库或 ExecuTorch）

2.5) PT2 问题 — 特殊处理

3) 重定向至次级 Oncall

Oncall 重定向标签

4) 标记 issue（如果未转移/重定向）

5) 高优先级 — 需要人工审核

6) bot-triaged（自动）

7) 标记为已分类

V1 限制

🇺🇸English

PyTorch Issue Triage Skill

Contents

MCP Tools Available

Labels You Must NEVER Add

Never Override Human Labels

Issue Triage (for each issue)

0) Already Routed — SKIP

1) Question vs Bug/Feature

1.5) Needs Reproduction — External Files

1.55) Needs Reproduction — Other Cases

1.6) Edge Cases & Numerical Accuracy

2) Transfer (domain library or ExecuTorch)

2.5) PT2 Issues — Special Handling

3) Redirect to Secondary Oncall

Oncall Redirect Labels

4) Label the issue (if NOT transferred/redirected)

5) High Priority — REQUIRES HUMAN REVIEW

6) bot-triaged (automatic)

7) Mark triaged

V1 Constraints

最新 Skills