Python代码重构指南：提升可读性、可维护性与代码质量的最佳实践

python-refactor by acaprino/alfio-claude-plugins

110 周安装量

GitHub

安装命令

npx skills add https://github.com/acaprino/alfio-claude-plugins --skill python-refactor

软件工程开发代码质量

🇨🇳中文介绍

Python 代码重构

目的

将复杂、难以理解的 Python 代码转换为清晰、文档完善、易于维护的代码，同时保持正确性。此技能指导系统性的重构，优先考虑代码的人类可读性，而不牺牲正确性或合理的性能。

调用时机

在以下情况调用此技能：

用户明确要求“人性化”、“可读”、“可维护”、“整洁”或“重构”代码改进
代码审查过程标记出可理解性或可维护性问题
处理需要现代化的遗留代码
为团队入职或教育场景准备代码
代码复杂度指标超出合理阈值
函数或模块难以理解或修改
红色警报指标：文件 >500 行且函数分散、全局状态混乱、多个 global 语句、没有清晰的模块/类组织、配置与业务逻辑混杂

不要在以下情况调用此技能：

代码对性能要求极高，且性能分析显示需要优先优化
代码计划被删除或替换
外部依赖需要上游贡献而非重构
用户明确要求性能优化而非可读性

核心原则

按优先级顺序遵循以下原则：

复杂代码优先采用结构化面向对象编程 - 具有共享状态、多重关注点或分散全局函数的代码应重构为组织良好的类和模块。具有全局状态和纠缠依赖的脚本式代码最能从面向对象编程中受益。然而，具有纯函数的简单模块、使用 click/argparse 的 CLI 工具以及函数式数据管道不需要强行塞入类中。
清晰优于巧妙 - 显式、显而易见的代码胜过隐式、巧妙的代码
保持正确性 - 所有测试必须通过；行为必须保持不变
单一职责 - 每个类和函数应做好一件事（SOLID 原则）
自文档化结构 - 代码结构说明“做什么”，注释解释“为什么”
渐进式披露 - 分层揭示复杂性，而非一次性全部展示
合理性能 - 未经明确批准，绝不牺牲超过 2 倍的性能

关键约束

🇺🇸English

Python Refactor

Purpose

Transform complex, hard-to-understand Python code into clear, well-documented, maintainable code while preserving correctness. This skill guides systematic refactoring that prioritizes human comprehension without sacrificing correctness or reasonable performance.

When to Invoke

Invoke this skill when:

User explicitly requests "human", "readable", "maintainable", "clean", or "refactor" code improvements
Code review processes flag comprehension or maintainability issues
Working with legacy code that needs modernization
Preparing code for team onboarding or educational contexts
Code complexity metrics exceed reasonable thresholds
Functions or modules are difficult to understand or modify
RED FLAG indicators: file >500 lines with scattered functions and global state, multiple global statements, no clear module/class organization, configuration mixed with business logic

Do NOT invoke this skill when:

Code is performance-critical and profiling shows optimization is needed first
Code is scheduled for deletion or replacement
External dependencies require upstream contributions instead
User explicitly requests performance optimization over readability

Core Principles

Follow these principles in priority order:

Prefer structured OOP for complex code - Code with shared state, multiple concerns, or scattered global functions should be restructured into well-organized classes and modules. Script-like code with global state and tangled dependencies benefits most from OOP. However, simple modules with pure functions, CLI tools using click/argparse, and functional data pipelines don't need to be forced into classes.

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

回归预防（强制）

重构绝不能引入技术、逻辑或功能上的回归。

在任何重构会话之前，请阅读并应用 references/REGRESSION_PREVENTION.md。

每次重构会话前：

测试套件 100% 通过
目标代码覆盖率 >= 80%（如果未达到，先编写测试）
捕获关键边缘用例的黄金输出
保存静态分析基线

每次微变更后（不是最后，而是每一次）：

flake8 --select=F821,E999 -> 0 错误
pytest -x -> 全部通过
抽查 1 个边缘用例以确保行为未变

如果任何检查失败： 停止 -> 回滚 -> 分析 -> 修复方法 -> 重试

任何回归 = 重构的彻底失败

分四个阶段执行重构，并在每个步骤进行验证。

在进行任何更改之前，全面分析代码：

阅读正在重构的整个代码库部分以理解上下文
识别可读性问题（使用反模式参考 references/anti-patterns.md）：
- 检查脚本式/过程式代码（全局状态、分散的函数、无清晰结构）
- 检查上帝对象/类（类承担过多职责）
- 复杂的嵌套条件、长函数、魔法数字、晦涩的名称等
评估架构（参见 references/oop_principles.md）：
- 代码是否组织在适当的类和模块中？
- 是否存在应被封装的全局状态？
- 职责是否适当分离？
- 是否遵循 SOLID 原则？
- 是否使用依赖注入而非硬编码依赖？
测量当前指标（使用 scripts/measure_complexity.py 或 scripts/analyze_multi_metrics.py）
运行代码检查分析（参见下面的工具推荐，了解使用哪个工具）
检查测试覆盖率 - 识别重构前需要填补的空白
记录发现（使用分析模板 assets/templates/analysis_template.md）

输出： 按影响和风险排序的问题优先级列表。

以设计安全的方式系统规划重构方法：

按类型识别变更：
- 非破坏性： 重命名、文档、类型提示 -> 低风险
- 破坏性： 移除全局变量、删除函数、替换 API -> 高风险
对于破坏性变更 - 创建迁移计划（强制）：
- 搜索要移除的每个元素的所有使用处
- 记录每个找到的使用处，包括文件、行号和用法类型
- 如果无法创建完整的迁移计划，则不能进行破坏性更改
风险评估（低/中/高）
依赖识别 - 还有什么依赖于这段代码？
测试策略 - 需要什么测试？什么可能被破坏？
变更排序 - 从最安全到最危险的顺序排列变更
预期结果 - 记录哪些指标应改进以及改进多少

输出： 包含排序变更、破坏性变更的迁移计划、测试策略和回滚计划的重构计划。

使用设计安全工作流程应用重构模式。

对于非破坏性变更（随时可安全进行）：

重命名变量/函数以提高清晰度
提取魔法数字/字符串为命名常量
添加/改进文档和类型提示
添加卫语句以减少嵌套

对于破坏性变更（移除/替换代码） - 严格协议：

创建新结构（暂不删除） - 编写新类/函数，添加测试
全面搜索要移除元素的所有使用处
创建迁移清单，记录每个找到的使用处
迁移一个使用处，勾选清单，每次迁移后运行静态分析 + 测试
验证完全迁移 - 重新运行原始搜索，应找到零个旧引用
移除旧代码（仅在 100% 迁移验证后）

绝不跳过破坏性变更的迁移清单
在测试前运行静态分析 - 立即捕获 NameErrors
一次一个模式 - 不要在一次变更中混合多个重构模式
原子提交 - 每个迁移步骤都有自己的提交
遇到任何错误即停止 - 静态分析错误或测试失败需要立即修复/回滚

重构顺序（推荐序列）：

将脚本式代码转换为适当的架构（如果代码有全局状态和分散的函数）。参见 references/examples/script_to_oop_transformation.md
重命名变量/函数以提高清晰度
提取魔法数字/字符串为命名常量（作为类常量或枚举）
添加/改进文档和类型提示
提取方法以减少函数长度
使用卫语句简化条件
减少嵌套深度
最终审查：确保关注点分离清晰

输出： 通过所有测试的重构后代码，具有清晰的提交历史。

客观验证改进：

首先运行静态分析（在测试前捕获错误）：

flake8 <file> --select=F821,E0602  # 未定义名称/变量
flake8 <file> --select=F401        # 未使用的导入
flake8 <file>                       # 完整质量检查

强制要求： 必须零个 F821 和 E0602 错误

运行完整测试套件 - 要求 100% 通过率
验证架构改进：
- 确认全局状态已被消除或适当封装
- 验证代码已组织在适当的模块/类中
- 检查职责是否适当分离
- 根据 SOLID 原则进行验证（参见 references/oop_principles.md）
比较重构前后的指标（使用 scripts/measure_complexity.py 或 scripts/analyze_multi_metrics.py）
性能回归检查 - 对热点路径运行 scripts/benchmark_changes.py
生成总结报告（使用 assets/templates/summary_template.md 中的格式）
标记为需要人工审查（如果出现以下情况）：
- 性能下降 >10%
- 公共 API 签名发生变更
- 测试覆盖率下降
- 进行了重大的架构变更

输出： 包含测试结果、指标对比、性能基准和质量总结的全面验证报告。

系统地应用这些模式。完整目录和示例请参见 references/patterns.md。

关键模式（摘要）

卫语句 - 用提前返回来替换嵌套条件。参见 references/patterns.md
提取方法 - 将大函数拆分为专注的单元。重置嵌套计数器（对认知复杂度最有效）
字典分派 - 用查找表消除 if-elif 链
匹配语句（Python 3.10+） - switch 语句计为 +1 总数，而非每个分支
命名布尔条件 - 将复杂的布尔表达式提取到命名变量中
封装全局状态 - 将全局变量移动到具有适当封装的类中
分组相关函数 - 按职责将分散的函数组织到类中
创建领域模型 - 用数据类和枚举替换原始字典
应用依赖注入 - 用注入的依赖替换硬编码的依赖

认知复杂度计算规则和降低模式请参见 references/cognitive_complexity_guide.md。

变量： 描述性名称，布尔值如 is_active/has_permission/can_edit，集合用复数形式
函数： 动词 + 宾语（calculate_total, validate_email），布尔查询如 is_valid()/has_items()
常量： UPPERCASE_WITH_UNDERSCORES，替换魔法数字/字符串
类： PascalCase 名词（UserAccount, PaymentProcessor）

函数文档字符串 - 记录目的、参数、返回值、引发的异常（推荐 Google 风格）
模块文档 - 目的和关键依赖
行内注释 - 仅用于非显而易见的“为什么”
类型提示 - 所有公共 API 和复杂的内部代码

面向对象编程转换模式

用于将脚本式代码转换为结构化面向对象编程。完整指南请参见 references/examples/script_to_oop_transformation.md，SOLID 原则请参见 references/oop_principles.md。

需要修复的反模式

完整目录请参见 references/anti-patterns.md。优先级顺序：

关键： 具有全局状态的脚本式/过程式代码，上帝对象/上帝类高：复杂嵌套条件（>3 层）、长函数（>30 行）、魔法数字、晦涩名称、缺少类型提示、缺少文档字符串中：重复代码、基本类型偏执、长参数列表（>5）低：命名不一致、冗余注释、未使用的导入

主要工具栈：Ruff + Complexipy（推荐用于新项目）

pip install ruff complexipy radon wily

ruff check src/                              # 快速代码检查（Rust，替代 flake8+插件）
complexipy src/ --max-complexity-allowed 15  # 认知复杂度（Rust）
radon mi src/ -s                             # 可维护性指数

完整配置（pyproject.toml、pre-commit hooks、GitHub Actions、CLI 用法）请参见 references/cognitive_complexity_guide.md。

替代方案：Flake8（用于已在使用它的项目）

scripts/analyze_with_flake8.py 和 scripts/compare_flake8_reports.py 脚本使用 flake8。精选插件列表请参见 references/flake8_plugins_guide.md。

使用 scripts/analyze_multi_metrics.py 将认知复杂度（complexipy）、圈复杂度（radon）和可维护性指数合并到单个报告中。

指标	工具	用途
认知复杂度	complexipy	人类理解
圈复杂度	ruff (C901), radon	测试规划
可维护性指数	radon	整体代码健康

圈复杂度：每个函数 <10（15 警告，20 错误）
认知复杂度：每个函数 <15（SonarQube 默认值，20 警告）
函数长度：<30 行（50 警告）
嵌套深度：<=3 层
文档字符串覆盖率：公共函数 >80%
类型提示覆盖率：公共 API >90%

使用 Wily 进行历史追踪

监控随时间变化的趋势，而不仅仅是阈值。设置和 CI 集成请参见 references/cognitive_complexity_guide.md。

完整指南请参见 references/REGRESSION_PREVENTION.md。关键陷阱：

不完整的迁移 - 在迁移所有使用处之前删除旧代码（导致 NameErrors）
部分模式应用 - 对某些函数应用重构，但未应用于其他函数
破坏公共 API - 更改外部代码使用的函数签名
假设测试覆盖一切 - 测试通过但出现运行时错误（运行静态分析！）

使用 assets/templates/summary_template.md 中的模板构建重构输出。包括：

所做的更改及其理由和风险级别
重构前后的指标对比表
测试结果和性能影响
风险评估和人工审查建议

与同包技能的集成

python-tdd - 在重构前设置测试，重构后验证覆盖率
python-performance-optimization - 重构前后的深度性能分析
python-packaging - 如果重构库，处理 pyproject.toml 和分发
uv-package-manager - 使用 uv run ruff, uv run complexipy 执行工具
async-python-patterns - 重构异步代码时参考异步模式

边缘情况和限制

何时不重构： 性能关键型优化代码（先进行性能分析）、计划删除的代码、外部依赖（向上游贡献）、无人需要修改的稳定遗留代码。

限制： 无法改进算法复杂度（那是算法变更，而非重构）。无法添加代码/注释中不存在的领域知识。没有测试无法保证正确性。代码风格偏好各异 - 根据团队约定进行调整。

重构前后示例请参见 references/examples/：

script_to_oop_transformation.md - 从脚本式代码到整洁面向对象编程架构的完整转换
python_complexity_reduction.md - 嵌套条件和长函数
typescript_naming_improvements.md - 变量和函数命名模式（跨语言参考）

当满足以下条件时，重构成功：

零回归 - 所有现有测试通过，行为未变
黄金主版本匹配 - 记录的关键用例输出相同
复杂度指标改进（在总结中记录）
无性能退化 >10%（或获得明确批准）
文档覆盖率提高
代码对人类来说更容易理解
未引入新的安全漏洞
变更具有原子性并在 git 历史中记录良好
Wily 趋势 - 与先前提交相比复杂度未增加
静态分析显示改进

Clarity over cleverness - Explicit, obvious code beats implicit, clever code

Preserve correctness - All tests must pass; behavior must remain identical

Single Responsibility - Each class and function should do one thing well (SOLID principles)

Self-documenting structure - Code structure tells what, comments explain why

Progressive disclosure - Reveal complexity in layers, not all at once

Reasonable performance - Never sacrifice >2x performance without explicit approval

ALWAYS observe these constraints:

SAFETY BY DESIGN - Use mandatory migration checklists for destructive changes. Create new structure, search all usages, migrate all, verify, only then remove old code. NEVER remove code before 100% migration verified.
STATIC ANALYSIS FIRST - Run flake8 --select=F821,E0602 before tests to catch NameErrors immediately
PRESERVE BEHAVIOR - All existing tests must pass after refactoring
NO PERFORMANCE REGRESSION - Never degrade performance >2x without explicit user approval
NO API CHANGES - Public APIs remain unchanged unless explicitly requested and documented
NO OVER-ENGINEERING - Simple code stays simple; don't add unnecessary abstraction
NO MAGIC - No framework magic, no metaprogramming unless absolutely necessary
VALIDATE CONTINUOUSLY - Run static analysis + tests after each logical change

Regression Prevention (MANDATORY)

Refactoring must NEVER introduce technical, logical, or functional regressions.

Read and apply references/REGRESSION_PREVENTION.md before any refactoring session.

Before each refactoring session:

Test suite passes at 100%
Coverage >= 80% on target code (if not, write tests FIRST)
Golden outputs captured for critical edge cases
Static analysis baseline saved

After each micro-change (not at the end, EVERY SINGLE ONE):

flake8 --select=F821,E999 -> 0 errors
pytest -x -> all passing
Spot check 1 edge case for unchanged behavior

If ANY check fails: STOP -> REVERT -> ANALYZE -> FIX APPROACH -> RETRY

ANY REGRESSION = TOTAL FAILURE OF THE REFACTORING

Refactoring Workflow

Execute refactoring in four phases with validation at each step.

Before making any changes, analyze the code comprehensively:

Read the entire codebase section being refactored to understand context
Identify readability issues using the anti-patterns reference (see references/anti-patterns.md):
- Check for script-like/procedural code (global state, scattered functions, no clear structure)
- Check for God Objects/Classes (classes doing too much)
- Complex nested conditionals, long functions, magic numbers, cryptic names, etc.
Assess architecture (see references/oop_principles.md):
- Is code organized in proper classes and modules?
- Is there global state that should be encapsulated?
- Are responsibilities properly separated?
- Are SOLID principles followed?
- Is dependency injection used instead of hard-coded dependencies?
Measure current metrics using scripts/measure_complexity.py or scripts/analyze_multi_metrics.py
Run linting analysis (see Tooling Recommendations below for which tool to use)
Check test coverage - Identify gaps that need filling before refactoring
Document findings using the analysis template (see assets/templates/analysis_template.md)

Output: Prioritized list of issues by impact and risk.

Plan the refactoring approach systematically with safety-by-design :

Identify changes by type:
- Non-destructive: Renames, documentation, type hints -> Low risk
- Destructive: Removing globals, deleting functions, replacing APIs -> High risk
For DESTRUCTIVE changes - CREATE MIGRATION PLAN (MANDATORY):
- Search for ALL usages of each element to be removed
- Document every found usage with file, line number, and usage type
- If you cannot create a complete migration plan, you CANNOT proceed with the destructive change
Risk assessment for each proposed change (Low/Medium/High)
Dependency identification - What else depends on this code?
Test strategy - What tests are needed? What might break?
Change ordering - Sequence changes from safest to riskiest
Expected outcomes - Document what metrics should improve and by how much

Output: Refactoring plan with sequenced changes, migration plans for destructive changes, test strategy, and rollback plan.

Apply refactoring patterns using safety-by-design workflow.

For NON-DESTRUCTIVE changes (safe to do anytime):

Rename variables/functions for clarity
Extract magic numbers/strings to named constants
Add/improve documentation and type hints
Add guard clauses to reduce nesting

For DESTRUCTIVE changes (removing/replacing code) - STRICT PROTOCOL:

CREATE new structure (no removal yet) - write new classes/functions, add tests
SEARCH comprehensively for ALL usages of the element being removed
CREATE migration checklist documenting every found usage
MIGRATE one usage at a time, checking off the list, running static analysis + tests after each
VERIFY complete migration - re-run original searches, should find zero old references
REMOVE old code only after 100% migration verified

NEVER skip the migration checklist for destructive changes
Run static analysis BEFORE tests - Catch NameErrors immediately
One pattern at a time - Never mix multiple refactoring patterns in one change
Atomic commits - Each migration step gets its own commit
Stop on ANY error - Static analysis errors OR test failures require immediate fix/revert

Refactoring order (recommended sequence):

Transform script-like code to proper architecture (if code has global state and scattered functions). See references/examples/script_to_oop_transformation.md
Rename variables/functions for clarity
Extract magic numbers/strings to named constants (as class constants or enums)
Add/improve documentation and type hints
Extract methods to reduce function length
Simplify conditionals with guard clauses
Reduce nesting depth
Final review: Ensure separation of concerns is clean

Output: Refactored code passing all tests with clear commit history.

Validate improvements objectively:

Run static analysis FIRST (catch errors before tests):

flake8 <file> --select=F821,E0602  # Undefined names/variables
flake8 <file> --select=F401        # Unused imports
flake8 <file>                       # Full quality check

MANDATORY: Zero F821 and E0602 errors required

Run full test suite - 100% pass rate required
Validate architecture improvements :
- Confirm global state has been eliminated or properly encapsulated
- Verify code is organized in proper modules/classes
- Check that responsibilities are properly separated
- Validate against SOLID principles (see references/oop_principles.md)
Compare before/after metrics using scripts/measure_complexity.py or scripts/analyze_multi_metrics.py
Performance regression check - Run scripts/benchmark_changes.py for hot paths
Generate summary report using format from assets/templates/summary_template.md
Flag for human review if:
- Performance degraded >10%
- Public API signatures changed
- Test coverage decreased
- Significant architectural changes were made

Output: Comprehensive validation report with test results, metrics comparison, performance benchmarks, and quality summary.

Refactoring Patterns

Apply these patterns systematically. See references/patterns.md for full catalog with examples.

Key Patterns (summary)

Guard Clauses - Replace nested conditionals with early returns. See references/patterns.md
Extract Method - Split large functions into focused units. Resets nesting counter (most powerful for cognitive complexity)
Dictionary Dispatch - Eliminate if-elif chains with lookup tables
Match Statement (Python 3.10+) - switch counts as +1 total, not per branch
Named Boolean Conditions - Extract complex boolean expressions into named variables
Encapsulate Global State - Move globals into classes with proper encapsulation
Group Related Functions - Organize scattered functions into classes by responsibility
Create Domain Models - Replace primitive dicts with dataclasses and enums
Apply Dependency Injection - Replace hard-coded dependencies with injected ones

See references/cognitive_complexity_guide.md for cognitive complexity calculation rules and reduction patterns.

Variables: Descriptive names, booleans as is_active/has_permission/can_edit, collections as plurals
Functions: Verb + object (calculate_total, validate_email), boolean queries as is_valid()/has_items()
Constants: UPPERCASE_WITH_UNDERSCORES, replace magic numbers/strings
Classes: PascalCase nouns (UserAccount, PaymentProcessor)

Documentation Patterns

Function Docstrings - Document purpose, args, returns, raises (Google style preferred)
Module Documentation - Purpose and key dependencies
Inline Comments - Only for non-obvious "why"
Type Hints - All public APIs and complex internals

OOP Transformation Patterns

For transforming script-like code to structured OOP. See references/examples/script_to_oop_transformation.md for a complete guide and references/oop_principles.md for SOLID principles.

Anti-Patterns to Fix

See references/anti-patterns.md for the full catalog. Priority order:

Critical: Script-like/procedural code with global state, God Object/God Class High: Complex nested conditionals (>3 levels), long functions (>30 lines), magic numbers, cryptic names, missing type hints, missing docstrings Medium: Duplicate code, primitive obsession, long parameter lists (>5) Low: Inconsistent naming, redundant comments, unused imports

Tooling Recommendations

Primary Stack: Ruff + Complexipy (recommended for new projects)

pip install ruff complexipy radon wily

ruff check src/                              # Fast linting (Rust, replaces flake8+plugins)
complexipy src/ --max-complexity-allowed 15  # Cognitive complexity (Rust)
radon mi src/ -s                             # Maintainability Index

See references/cognitive_complexity_guide.md for complete configuration (pyproject.toml, pre-commit hooks, GitHub Actions, CLI usage).

Alternative: Flake8 (for projects already using it)

The scripts/analyze_with_flake8.py and scripts/compare_flake8_reports.py scripts use flake8. See references/flake8_plugins_guide.md for the curated plugin list.

Multi-Metric Analysis

Use scripts/analyze_multi_metrics.py to combine cognitive complexity (complexipy), cyclomatic complexity (radon), and maintainability index in a single report.

Metric	Tool	Use
Cognitive Complexity	complexipy	Human comprehension
Cyclomatic Complexity	ruff (C901), radon	Test planning
Maintainability Index	radon	Overall code health

Cyclomatic complexity: <10 per function (warning at 15, error at 20)
Cognitive complexity: <15 per function (SonarQube default, warning at 20)
Function length: <30 lines (warning at 50)
Nesting depth: <=3 levels
Docstring coverage: >80% for public functions
Type hint coverage: >90% for public APIs

Historical Tracking with Wily

Monitor trends over time, not just thresholds. See references/cognitive_complexity_guide.md for setup and CI integration.

Common Refactoring Mistakes

See references/REGRESSION_PREVENTION.md for the full guide. Key traps:

Incomplete Migration - Removing old code before ALL usages are migrated (causes NameErrors)
Partial Pattern Application - Applying refactoring to some functions but not others
Breaking Public APIs - Changing function signatures used by external code
Assuming Tests Cover Everything - Tests pass but runtime errors occur (run static analysis!)

Structure refactoring output using the template from assets/templates/summary_template.md. Include:

Changes made with rationale and risk level
Before/after metrics comparison table
Test results and performance impact
Risk assessment and human review recommendation

Related tools -- when to use what

humanize (agent, humanize plugin) -- Multi-language cosmetic cleanup. Renames local variables, improves comments, simplifies structure. Lowest regression risk. Use for: "make this readable", "clean up naming".
python-refactor (this skill) -- Python-only deep restructuring. OOP transformation, SOLID principles, complexity metrics, migration checklists, benchmark validation. Use for: "refactor this module", "reduce complexity", "transform to OOP".

Escalation path: humanize -> python-refactor (from safest to most thorough).

Integration with Same-Package Skills

python-tdd - Set up tests before refactoring, validate coverage after
python-performance-optimization - Deep profiling before/after refactoring
python-packaging - If refactoring a library, handle pyproject.toml and distribution
uv-package-manager - Use uv run ruff, uv run complexipy for tool execution
async-python-patterns - Reference async patterns when refactoring async code

Edge Cases and Limitations

When NOT to Refactor: Performance-critical optimized code (profile first), code scheduled for deletion, external dependencies (contribute upstream), stable legacy code nobody needs to modify.

Limitations: Cannot improve algorithmic complexity (that's algorithm change, not refactoring). Cannot add domain knowledge not in code/comments. Cannot guarantee correctness without tests. Code style preferences vary - adjust based on team conventions.

See references/examples/ for before/after examples:

script_to_oop_transformation.md - Complete transformation from script-like code to clean OOP architecture
python_complexity_reduction.md - Nested conditionals and long functions
typescript_naming_improvements.md - Variable and function naming patterns (cross-language reference)

Refactoring is successful when:

ZERO regressions - All existing tests pass, behavior unchanged
Golden master match - Identical output for documented critical cases
Complexity metrics improved (documented in summary)
No performance regression >10% (or explicit approval obtained)
Documentation coverage improved
Code is easier for humans to understand
No new security vulnerabilities introduced
Changes are atomic and well-documented in git history
Wily trend - Complexity not increased compared to previous commit
Static analysis shows improvement

React 组合模式指南：Vercel 组件架构最佳实践，提升代码可维护性

120,000 周安装

Python代码重构指南：提升可读性、可维护性与代码质量的最佳实践

🇨🇳中文介绍

Python 代码重构

目的

调用时机

核心原则

关键约束

🇺🇸English

Python Refactor

Purpose

When to Invoke

Core Principles

相关 Skills

回归预防（强制）

重构工作流程

阶段 1：分析

阶段 2：规划

阶段 3：执行

对于非破坏性变更（随时可安全进行）：

对于破坏性变更（移除/替换代码） - 严格协议：

执行规则

重构顺序（推荐序列）：

阶段 4：验证

重构模式

关键模式（摘要）

命名约定

文档模式

面向对象编程转换模式

需要修复的反模式

工具推荐

主要工具栈：Ruff + Complexipy（推荐用于新项目）

替代方案：Flake8（用于已在使用它的项目）

多指标分析

指标目标

使用 Wily 进行历史追踪

常见重构错误

输出格式

相关工具 -- 何时使用什么

与同包技能的集成

边缘情况和限制

示例

成功标准

Key Constraints

Regression Prevention (MANDATORY)

Refactoring Workflow

Phase 1: Analysis

Phase 2: Planning

Phase 3: Execution

For NON-DESTRUCTIVE changes (safe to do anytime):

For DESTRUCTIVE changes (removing/replacing code) - STRICT PROTOCOL:

Execution Rules

Refactoring order (recommended sequence):

Phase 4: Validation