决策框架技能：AI智能体决策规划与评分方法，优化战略分析与执行计划

Decision Frameworks by bejranonda/llm-autonomous-agent-plugin-for-claude

18 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/bejranonda/llm-autonomous-agent-plugin-for-claude --skill 'Decision Frameworks'

AI/机器学习自动化系统架构

🇨🇳中文介绍

决策框架技能

概述

本技能提供决策框架、评分方法和规划策略，专门针对四层架构中的 第 2 组（决策与规划） 智能体。它涵盖如何评估第 1 组的建议、纳入用户偏好、创建执行计划，以及做出平衡多种因素的最佳决策。

何时应用此技能

在以下情况使用此技能：

评估来自第 1 组（战略分析与情报）的建议
为第 3 组（执行与实施）创建执行计划
对相互竞争的建议进行优先级排序
将用户偏好纳入决策
权衡利弊（速度 vs 质量，风险 vs 收益）
在多个有效方法之间做出选择
针对特定目标（质量、速度、成本）进行优化

适用于：

strategic-planner（主决策者）
preference-coordinator（用户偏好专家）
任何进行规划决策的第 2 组智能体

第 2 组角色回顾

第 2 组：决策与规划（"理事会"）

输入：来自第 1 组的建议及其置信度分数
过程：评估、优先级排序、决策、规划
输出：提供给第 3 组的带有优先级和偏好的执行计划
关键职责：做出平衡分析、用户偏好、历史成功率和风险的最佳决策

决策框架

框架 1：建议评估矩阵

目的：从多个维度对每个第 1 组的建议进行评分

评分公式（0-100）：

Recommendation Score =
  (Confidence from Group 1    × 30%) +  # 分析师的置信度如何？
  (User Preference Alignment  × 25%) +  # 是否符合用户风格？
  (Historical Success Rate    × 25%) +  # 之前是否成功过？
  (Risk Assessment            × 20%)    # 风险水平如何？

Where each component is 0-100

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

框架 2：多标准决策分析（MCDA）

目的：在多个相互竞争的建议之间做出选择

方法：跨标准的加权评分

示例 - 在 3 种重构方法之间选择：

criteria = {
    "quality_impact": 0.30,      # 质量将提升多少？
    "effort_required": 0.25,     # 需要多少时间/工作量？
    "risk_level": 0.20,          # 风险有多大？
    "user_alignment": 0.15,      # 是否符合用户风格？
    "maintainability": 0.10      # 长期效益如何？
}

options = [
    {
        "name": "Modular Refactoring",
        "quality_impact": 90,
        "effort_required": 60,  # Higher effort → lower score
        "risk_level": 80,  # Lower risk → higher score
        "user_alignment": 85,
        "maintainability": 95
    },
    {
        "name": "Incremental Refactoring",
        "quality_impact": 70,
        "effort_required": 85,  # Lower effort → higher score
        "risk_level": 90,
        "user_alignment": 90,
        "maintainability": 75
    },
    {
        "name": "Complete Rewrite",
        "quality_impact": 100,
        "effort_required": 20,  # Very high effort → very low score
        "risk_level": 40,  # High risk → low score
        "user_alignment": 60,
        "maintainability": 100
    }
]

def calculate_mcda_score(option, criteria):
    score = 0
    for criterion, weight in criteria.items():
        score += option[criterion] * weight
    return score

scores = {opt["name"]: calculate_mcda_score(opt, criteria) for opt in options}
# Result:
# Modular Refactoring: 82.5
# Incremental Refactoring: 81.0
# Complete Rewrite: 63.0
# → Choose Modular Refactoring

根据用户偏好调整标准权重
将所有分数归一化到 0-100 范围
考虑负面标准（工作量、风险）的逆关系
记录所用权重的理由

框架 3：风险收益分析

目的：通过风险收益视角评估决策

         Low Benefit    |    High Benefit
---------|---------------|------------------
Low Risk | ⚠️ Avoid      | ✅ Do It (Quick Win)
High Risk| ❌ Never Do   | 🤔 Careful Analysis Required

def categorize_decision(benefit_score, risk_level):
    """
    benefit_score: 0-100 (higher = more benefit)
    risk_level: 0-100 (higher = more risky)
    """
    high_benefit = benefit_score >= 70
    low_risk = risk_level <= 30

    if high_benefit and low_risk:
        return "quick_win", "High benefit, low risk - proceed immediately"
    elif high_benefit and not low_risk:
        return "high_value_high_risk", "Requires careful analysis and mitigation strategies"
    elif not high_benefit and low_risk:
        return "avoid", "Not worth the effort even if safe"
    else:
        return "never_do", "High risk, low benefit - reject"

需考虑的风险因素：

技术风险：破坏性变更、向后兼容性、依赖问题
进度风险：可能延迟其他任务、未知复杂性
质量风险：可能引入错误、可能降低测试覆盖率
用户影响：干扰用户工作流程、显著改变行为
可逆性：如果失败，我们能否撤销？

需考虑的收益因素：

质量影响：提高代码质量、减少技术债务
性能影响：使系统更快、更高效
可维护性影响：更易于维护和扩展
用户体验影响：更好的用户体验、更少的错误
战略价值：与长期目标保持一致

框架 4：优先级矩阵（艾森豪威尔矩阵）

目的：根据紧急性和重要性对多个任务进行优先级排序

           Not Urgent    |     Urgent
-----------|---------------|------------------
Important  | 📋 Schedule   | 🔥 Do First
Not Import | 🗑️ Eliminate  | ⚡ Delegate/Quick

def prioritize_tasks(recommendations):
    prioritized = {
        "do_first": [],      # Urgent + Important
        "schedule": [],      # Not Urgent + Important
        "quick_wins": [],    # Urgent + Not Important
        "eliminate": []      # Not Urgent + Not Important
    }

    for rec in recommendations:
        urgent = (
            rec.get("priority") == "high" or
            rec.get("severity") in ["critical", "high"] or
            rec.get("user_impact") == "high"
        )

        important = (
            rec.get("expected_impact") == "high" or
            rec.get("quality_impact") >= 15 or
            rec.get("strategic_value") == "high"
        )

        if urgent and important:
            prioritized["do_first"].append(rec)
        elif not urgent and important:
            prioritized["schedule"].append(rec)
        elif urgent and not important:
            prioritized["quick_wins"].append(rec)
        else:
            prioritized["eliminate"].append(rec)

    return prioritized

首先执行（紧急 + 重要）- 立即执行
快速胜利（紧急 + 不重要）- 如果时间允许则执行
安排计划（不紧急 + 重要）- 为未来迭代做计划
消除（不紧急 + 不重要）- 拒绝或无限期推迟

偏好一致性评分

目的：量化建议与用户偏好的匹配程度

def calculate_preference_alignment(recommendation, user_prefs):
    """
    Returns 0-100 score for preference alignment
    """
    alignment_score = 0
    total_weight = 0

    # 1. Coding Style Alignment (25 points)
    coding_style_weight = 25
    total_weight += coding_style_weight

    if recommendation.get("verbosity") == user_prefs.get("coding_style", {}).get("verbosity"):
        alignment_score += coding_style_weight
    elif abs(verbosity_scale(recommendation.get("verbosity")) -
             verbosity_scale(user_prefs.get("coding_style", {}).get("verbosity"))) <= 1:
        alignment_score += coding_style_weight * 0.7  # Partial credit

    # 2. Quality Priority Alignment (30 points)
    quality_weight = 30
    total_weight += quality_weight

    user_quality_priorities = user_prefs.get("quality_priorities", {})
    rec_quality_focus = recommendation.get("quality_focus", [])

    # Check if recommendation focuses on user's top priorities
    matches = len([p for p in rec_quality_focus if user_quality_priorities.get(p, 0) >= 0.7])
    if matches > 0:
        alignment_score += quality_weight * (matches / len(rec_quality_focus))

    # 3. Workflow Compatibility (25 points)
    workflow_weight = 25
    total_weight += workflow_weight

    # Check auto-fix threshold
    if recommendation.get("confidence", 0) >= user_prefs.get("workflow", {}).get("auto_fix_threshold", 0.85):
        alignment_score += workflow_weight
    elif recommendation.get("confidence", 0) >= user_prefs.get("workflow", {}).get("auto_fix_threshold", 0.85) - 0.1:
        alignment_score += workflow_weight * 0.5

    # 4. Communication Style Alignment (20 points)
    comm_weight = 20
    total_weight += comm_weight

    rec_detail = recommendation.get("detail_level", "balanced")
    user_detail = user_prefs.get("communication", {}).get("detail_level", "balanced")

    if rec_detail == user_detail:
        alignment_score += comm_weight
    elif abs(detail_scale(rec_detail) - detail_scale(user_detail)) <= 1:
        alignment_score += comm_weight * 0.6

    return alignment_score

基于偏好的计划调整

目的：调整执行计划以匹配用户偏好

def adjust_plan_for_preferences(plan, user_prefs):
    """
    Modify execution plan to incorporate user preferences
    """
    adjusted_plan = plan.copy()

    # Adjust coding style
    if user_prefs.get("coding_style", {}).get("verbosity") == "concise":
        adjusted_plan["style_instructions"] = {
            "comments": "minimal",
            "docstrings": "one_line_only",
            "variable_names": "short_but_clear"
        }
    elif user_prefs.get("coding_style", {}).get("verbosity") == "verbose":
        adjusted_plan["style_instructions"] = {
            "comments": "extensive",
            "docstrings": "detailed_with_examples",
            "variable_names": "descriptive"
        }

    # Adjust quality targets based on user priorities
    quality_prefs = user_prefs.get("quality_priorities", {})
    adjusted_plan["quality_targets"] = {
        "tests": 70 + (quality_prefs.get("tests", 0.5) * 30),  # 70-100
        "documentation": 60 + (quality_prefs.get("documentation", 0.5) * 40),  # 60-100
        "code_quality": 75 + (quality_prefs.get("code_quality", 0.5) * 25)  # 75-100
    }

    # Adjust risk tolerance
    risk_tolerance = user_prefs.get("workflow", {}).get("risk_tolerance", "medium")
    if risk_tolerance == "low":
        adjusted_plan["constraints"]["max_auto_fix"] = 0.95  # Only very safe fixes
        adjusted_plan["require_confirmation"] = True
    elif risk_tolerance == "high":
        adjusted_plan["constraints"]["max_auto_fix"] = 0.75  # More aggressive fixes
        adjusted_plan["require_confirmation"] = False

    return adjusted_plan

框架：平衡的权衡评估

速度 vs 质量
风险 vs 收益
短期 vs 长期
简单性 vs 灵活性
性能 vs 可读性

def analyze_trade_offs(recommendation):
    """
    Identify and evaluate trade-offs in a recommendation
    """
    trade_offs = []

    # Trade-off 1: Speed vs Quality
    if recommendation.get("estimated_effort_hours", 0) < 2:
        # Quick implementation
        trade_offs.append({
            "type": "speed_vs_quality",
            "chosen": "speed",
            "gain": "Fast implementation, quick delivery",
            "cost": "May not achieve highest quality, might need refinement later",
            "acceptable": True  # Generally acceptable for small changes
        })

    # Trade-off 2: Risk vs Benefit
    benefit_score = recommendation.get("expected_impact_score", 50)
    risk_score = recommendation.get("risk_score", 50)

    if benefit_score > 80 and risk_score > 60:
        trade_offs.append({
            "type": "risk_vs_benefit",
            "chosen": "benefit",
            "gain": f"High benefit ({benefit_score}/100)",
            "cost": f"Moderate to high risk ({risk_score}/100)",
            "acceptable": benefit_score > risk_score * 1.3,  # Benefit outweighs risk by 30%+
            "mitigation": "Add extra testing, implement in phases, have rollback plan"
        })

    # Trade-off 3: Short-term vs Long-term
    if recommendation.get("type") == "quick_fix" and recommendation.get("technical_debt_added", 0) > 0:
        trade_offs.append({
            "type": "short_term_vs_long_term",
            "chosen": "short_term",
            "gain": "Immediate problem resolution",
            "cost": "Adds technical debt, will need proper fix later",
            "acceptable": recommendation.get("severity") == "critical",  # OK for critical fixes
            "followup": "Schedule proper refactoring in next sprint"
        })

    return trade_offs

def should_accept_trade_off(trade_off, user_prefs):
    """
    Decide if a trade-off is acceptable
    """
    # Check if user preferences lean toward chosen side
    if trade_off["type"] == "speed_vs_quality":
        if user_prefs.get("workflow", {}).get("prefer_speed"):
            return True
        elif user_prefs.get("quality_priorities", {}).get("code_quality", 0.5) > 0.8:
            return False  # User prioritizes quality

    # Check if gains outweigh costs
    if trade_off.get("gain_score", 0) > trade_off.get("cost_score", 0) * 1.5:
        return True  # 50% more gain than cost

    # Check if mitigation strategies exist
    if trade_off.get("mitigation") and len(trade_off.get("mitigation", "")) > 10:
        return True  # Has mitigation plan

    return trade_off.get("acceptable", False)

策略 1：增量执行计划

何时使用：大型变更、高风险或复杂重构

incremental_plan = {
    "approach": "incremental",
    "phases": [
        {
            "phase": 1,
            "name": "Foundation",
            "tasks": ["Extract core functions", "Add tests for extracted functions"],
            "duration_hours": 2,
            "validation_criteria": "All tests pass, coverage ≥ 80%",
            "rollback_plan": "Revert extraction if tests fail"
        },
        {
            "phase": 2,
            "name": "Integration",
            "tasks": ["Update callers to use extracted functions", "Add integration tests"],
            "duration_hours": 1.5,
            "validation_criteria": "No regressions, all integration tests pass",
            "rollback_plan": "Keep old functions as fallback"
        },
        {
            "phase": 3,
            "name": "Cleanup",
            "tasks": ["Remove old code", "Update documentation"],
            "duration_hours": 0.5,
            "validation_criteria": "No dead code, docs updated",
            "rollback_plan": "None needed - previous phases validated"
        }
    ],
    "total_duration_hours": 4,
    "checkpoint_frequency": "after_each_phase"
}

风险更低（每个阶段后验证）
如果出现问题可以提前停止
更容易调试问题
更适合学习（每个阶段后获得反馈）

策略 2：全面执行计划

何时使用：理解充分的变更、低风险、小范围

comprehensive_plan = {
    "approach": "comprehensive",
    "tasks": [
        {
            "task": "Refactor authentication module",
            "subtasks": [
                "Extract validation logic",
                "Extract authentication logic",
                "Extract authorization logic",
                "Add tests for all components",
                "Update callers",
                "Remove old code",
                "Update documentation"
            ],
            "duration_hours": 4,
            "validation_criteria": "All tests pass, coverage ≥ 80%, no regressions"
        }
    ],
    "checkpoint_frequency": "at_end_only"
}

执行更快（无阶段开销）
协调更简单
适合常规变更

策略 3：并行执行计划

何时使用：可以同时进行的独立变更

parallel_plan = {
    "approach": "parallel",
    "parallel_tracks": [
        {
            "track": "backend",
            "agent": "quality-controller",
            "tasks": ["Refactor API endpoints", "Add backend tests"],
            "duration_hours": 3
        },
        {
            "track": "frontend",
            "agent": "frontend-analyzer",
            "tasks": ["Update React components", "Add frontend tests"],
            "duration_hours": 2.5
        },
        {
            "track": "documentation",
            "agent": "documentation-generator",
            "tasks": ["Update API docs", "Update user guide"],
            "duration_hours": 1
        }
    ],
    "coordination_points": [
        {
            "after_hours": 2,
            "sync": "Ensure API contract matches frontend expectations"
        }
    ],
    "total_duration_hours": 3  # Max of parallel tracks
}

总时间最快
高效利用多个智能体
适合全栈变更

协调复杂性
如果同步不当可能导致集成问题

框架：根据上下文调整置信度

目的：根据额外因素校准建议置信度

def calibrate_confidence(recommendation, context):
    """
    Adjust recommendation confidence based on context
    Returns adjusted confidence (0.0-1.0)
    """
    base_confidence = recommendation.get("confidence", 0.5)

    # Adjustment factors
    adjustments = []

    # 1. Historical success with similar tasks
    similar_tasks = query_similar_tasks(recommendation)
    if similar_tasks:
        success_rate = sum(t.success for t in similar_tasks) / len(similar_tasks)
        if success_rate >= 0.9:
            adjustments.append(("high_historical_success", +0.1))
        elif success_rate <= 0.5:
            adjustments.append(("low_historical_success", -0.15))

    # 2. Untested pattern penalty
    pattern_reuse = recommendation.get("pattern_reuse_count", 0)
    if pattern_reuse == 0:
        adjustments.append(("untested_pattern", -0.1))
    elif pattern_reuse >= 5:
        adjustments.append(("proven_pattern", +0.05))

    # 3. Complexity factor
    complexity = recommendation.get("complexity", "medium")
    if complexity == "high":
        adjustments.append(("high_complexity", -0.1))
    elif complexity == "low":
        adjustments.append(("low_complexity", +0.05))

    # 4. User preference mismatch
    pref_alignment = calculate_preference_alignment(recommendation, context.get("user_prefs", {}))
    if pref_alignment < 50:
        adjustments.append(("low_preference_alignment", -0.08))

    # Apply adjustments
    adjusted_confidence = base_confidence
    for reason, delta in adjustments:
        adjusted_confidence += delta

    # Clamp to [0.0, 1.0]
    adjusted_confidence = max(0.0, min(1.0, adjusted_confidence))

    return {
        "original_confidence": base_confidence,
        "adjusted_confidence": adjusted_confidence,
        "adjustments": adjustments
    }

框架：记录每个决策

目的：为用户和学习创建透明、可理解的决策

from lib.decision_explainer import create_explanation

def create_decision_explanation(decision, recommendations, user_prefs, historical_data):
    """
    Create comprehensive explanation for a decision
    """
    explanation = create_explanation(
        decision_id=f"decision_{task_id}",
        decision=decision,
        recommendations=recommendations,
        user_preferences=user_prefs,
        historical_data=historical_data,
        context={
            "task_type": "refactoring",
            "complexity": "medium"
        }
    )

    return explanation
    # Returns:
    # - why_chosen: Primary reasons for this decision
    # - why_not_alternatives: Why other options rejected
    # - trade_offs: What was gained vs what was sacrificed
    # - confidence_factors: What increases/decreases confidence
    # - user_alignment: How decision aligns with user preferences
    # - analogy: Human-friendly comparison

示例解释输出：

## Decision: Modular Refactoring Approach

### Why This Decision?
**Primary Reason**: Highest combined score (82.5/100) balancing quality impact, effort, and risk.

**Supporting Reasons**:
1. Strong quality improvement potential (90/100)
2. Manageable effort (60/100 - approximately 4 hours)
3. Low risk with clear rollback options (80/100)
4. Excellent maintainability benefits (95/100)

### Why Not Alternatives?
**Incremental Refactoring (Score: 81.0)**: Close second, but lower quality impact (70 vs 90). Would take longer to achieve same quality level.

**Complete Rewrite (Score: 63.0)**: Rejected due to:
- Very high effort (20/100 - would take 20+ hours)
- High risk (40/100 - could introduce many bugs)
- Lower user alignment (60/100 - user prefers incremental changes)
Despite perfect quality potential, the risk-benefit ratio is unfavorable.

### Trade-offs Considered
**Time vs Quality**: Choosing modular approach over quick incremental fixes means:
- ✅ Gain: Significantly better long-term code quality
- ⚠️ Cost: Takes 1.5x longer than incremental approach
- ✓ Acceptable: Quality improvement worth the extra time

**Risk vs Benefit**: Moderate complexity with high reward:
- ✅ Gain: 90/100 quality improvement potential
- ⚠️ Cost: Some architectural risk in module boundaries
- ✓ Mitigation: Incremental implementation with validation checkpoints

### Confidence Factors
**High Confidence (0.88)**:
- ✓ Similar pattern succeeded 5 times previously (100% success rate)
- ✓ Strong alignment with user preferences (85/100)
- ⚠️ Moderate complexity reduces confidence slightly (-0.05)

### User Preference Alignment
- Coding Style: ✓ Matches preference for modular, well-organized code
- Quality Focus: ✓ User prioritizes maintainability (0.85) - this approach excels here
- Risk Tolerance: ✓ Medium risk acceptable for high-quality outcomes

### Analogy
Like reorganizing a messy closet by sorting items into clearly labeled boxes (modular refactoring) rather than just pushing things around (incremental) or building an entirely new closet system (complete rewrite). The sorting approach takes reasonable time, dramatically improves organization, and can be done safely one section at a time.

有效决策的指标：

✅ 决策置信度 > 0.80（有充分支持的决策）
✅ 用户偏好一致性 > 75%（决策符合用户风格）
✅ 执行成功率 > 90%（第 3 组成功执行计划）
✅ 计划调整率 < 20%（执行期间计划不需要重大修订）
✅ 用户满意度 > 85%（用户接受决策）
✅ 决策可解释性分数 > 80%（用户理解决策原因）

from lib.agent_performance_tracker import get_agent_performance

performance = get_agent_performance("strategic-planner")
print(f"Decision success rate: {performance['success_rate']:.1%}")
print(f"Average confidence: {performance['avg_confidence']:.2f}")
print(f"User approval rate: {performance['user_approval_rate']:.1%}")

lib/decision_explainer.py - 决策解释系统
lib/user_preference_learner.py - 用户偏好跟踪
lib/agent_performance_tracker.py - 决策结果跟踪
lib/inter_group_knowledge_transfer.py - 历史成功数据

docs/FOUR_TIER_ARCHITECTURE.md - 完整架构
agents/strategic-planner.md - 主决策者智能体
agents/preference-coordinator.md - 用户偏好专家
skills/group-collaboration/SKILL.md - 组间通信

🇺🇸English

Decision Frameworks Skill

Overview

This skill provides decision-making frameworks, scoring methodologies, and planning strategies specifically for Group 2 (Decision Making & Planning) agents in the four-tier architecture. It covers how to evaluate Group 1 recommendations, incorporate user preferences, create execution plans, and make optimal decisions that balance multiple factors.

When to Apply This Skill

Use this skill when:

Evaluating recommendations from Group 1 (Strategic Analysis & Intelligence)
Creating execution plans for Group 3 (Execution & Implementation)
Prioritizing competing recommendations
Incorporating user preferences into decisions
Balancing trade-offs (speed vs quality, risk vs benefit)
Deciding between multiple valid approaches
Optimizing for specific objectives (quality, speed, cost)

Required for:

strategic-planner (master decision-maker)
preference-coordinator (user preference specialist)
Any Group 2 agent making planning decisions

Group 2 Role Recap

Group 2: Decision Making & Planning (The "Council")

Input : Recommendations from Group 1 with confidence scores
Process : Evaluate, prioritize, decide, plan
Output : Execution plans for Group 3 with priorities and preferences
Key Responsibility : Make optimal decisions balancing analysis, user preferences, historical success, and risk

Decision-Making Frameworks

Framework 1: Recommendation Evaluation Matrix

Purpose : Score each Group 1 recommendation on multiple dimensions

Scoring Formula (0-100) :

Recommendation Score =
  (Confidence from Group 1    × 30%) +  # How confident is the analyst?
  (User Preference Alignment  × 25%) +  # Does it match user style?
  (Historical Success Rate    × 25%) +  # Has this worked before?
  (Risk Assessment            × 20%)    # What's the risk level?

Where each component is 0-100

Implementation :

def evaluate_recommendation(recommendation, user_prefs, historical_data):
    # Component 1: Confidence from Group 1 (0-100)
    confidence_score = recommendation.get("confidence", 0.5) * 100

    # Component 2: User Preference Alignment (0-100)
    preference_score = calculate_preference_alignment(
        recommendation,
        user_prefs
    )

    # Component 3: Historical Success Rate (0-100)
    similar_patterns = query_similar_tasks(recommendation)
    if similar_patterns:
        success_rate = sum(p.success for p in similar_patterns) / len(similar_patterns)
        historical_score = success_rate * 100
    else:
        historical_score = 50  # No data → neutral

    # Component 4: Risk Assessment (0-100, higher = safer)
    risk_score = assess_risk(recommendation)

    # Weighted average
    total_score = (
        confidence_score * 0.30 +
        preference_score * 0.25 +
        historical_score * 0.25 +
        risk_score * 0.20
    )

    return {
        "total_score": total_score,
        "confidence_score": confidence_score,
        "preference_score": preference_score,
        "historical_score": historical_score,
        "risk_score": risk_score
    }

Interpretation :

85-100 : Excellent recommendation - high confidence to proceed
70-84 : Good recommendation - proceed with standard caution
50-69 : Moderate recommendation - proceed carefully or seek alternatives
0-49 : Weak recommendation - consider rejecting or modifying significantly

Framework 2: Multi-Criteria Decision Analysis (MCDA)

Purpose : Choose between multiple competing recommendations

Method : Weighted scoring across criteria

Example - Choosing Between 3 Refactoring Approaches :

criteria = {
    "quality_impact": 0.30,      # How much will quality improve?
    "effort_required": 0.25,     # How much time/work?
    "risk_level": 0.20,          # How risky is it?
    "user_alignment": 0.15,      # Matches user style?
    "maintainability": 0.10      # Long-term benefits?
}

options = [
    {
        "name": "Modular Refactoring",
        "quality_impact": 90,
        "effort_required": 60,  # Higher effort → lower score
        "risk_level": 80,  # Lower risk → higher score
        "user_alignment": 85,
        "maintainability": 95
    },
    {
        "name": "Incremental Refactoring",
        "quality_impact": 70,
        "effort_required": 85,  # Lower effort → higher score
        "risk_level": 90,
        "user_alignment": 90,
        "maintainability": 75
    },
    {
        "name": "Complete Rewrite",
        "quality_impact": 100,
        "effort_required": 20,  # Very high effort → very low score
        "risk_level": 40,  # High risk → low score
        "user_alignment": 60,
        "maintainability": 100
    }
]

def calculate_mcda_score(option, criteria):
    score = 0
    for criterion, weight in criteria.items():
        score += option[criterion] * weight
    return score

scores = {opt["name"]: calculate_mcda_score(opt, criteria) for opt in options}
# Result:
# Modular Refactoring: 82.5
# Incremental Refactoring: 81.0
# Complete Rewrite: 63.0
# → Choose Modular Refactoring

Best Practices :

Adjust criterion weights based on user preferences
Normalize all scores to 0-100 range
Consider negative criteria (effort, risk) inversely
Document rationale for weights used

Framework 3: Risk-Benefit Analysis

Purpose : Evaluate decisions through risk-benefit lens

Matrix :

         Low Benefit    |    High Benefit
---------|---------------|------------------
Low Risk | ⚠️ Avoid      | ✅ Do It (Quick Win)
High Risk| ❌ Never Do   | 🤔 Careful Analysis Required

Implementation :

def categorize_decision(benefit_score, risk_level):
    """
    benefit_score: 0-100 (higher = more benefit)
    risk_level: 0-100 (higher = more risky)
    """
    high_benefit = benefit_score >= 70
    low_risk = risk_level <= 30

    if high_benefit and low_risk:
        return "quick_win", "High benefit, low risk - proceed immediately"
    elif high_benefit and not low_risk:
        return "high_value_high_risk", "Requires careful analysis and mitigation strategies"
    elif not high_benefit and low_risk:
        return "avoid", "Not worth the effort even if safe"
    else:
        return "never_do", "High risk, low benefit - reject"

Risk Factors to Consider :

Technical Risk : Breaking changes, backward compatibility, dependency issues
Schedule Risk : Could delay other tasks, unknown complexity
Quality Risk : Might introduce bugs, could reduce test coverage
User Impact : Disrupts user workflow, changes behavior significantly
Reversibility : Can we undo if it fails?

Benefit Factors to Consider :

Quality Impact : Improves code quality, reduces technical debt
Performance Impact : Makes system faster, more efficient
Maintainability Impact : Easier to maintain and extend
User Experience Impact : Better UX, fewer errors
Strategic Value : Aligns with long-term goals

Framework 4: Prioritization Matrix (Eisenhower Matrix)

Purpose : Prioritize multiple tasks by urgency and importance

Matrix :

           Not Urgent    |     Urgent
-----------|---------------|------------------
Important  | 📋 Schedule   | 🔥 Do First
Not Import | 🗑️ Eliminate  | ⚡ Delegate/Quick

Implementation :

def prioritize_tasks(recommendations):
    prioritized = {
        "do_first": [],      # Urgent + Important
        "schedule": [],      # Not Urgent + Important
        "quick_wins": [],    # Urgent + Not Important
        "eliminate": []      # Not Urgent + Not Important
    }

    for rec in recommendations:
        urgent = (
            rec.get("priority") == "high" or
            rec.get("severity") in ["critical", "high"] or
            rec.get("user_impact") == "high"
        )

        important = (
            rec.get("expected_impact") == "high" or
            rec.get("quality_impact") >= 15 or
            rec.get("strategic_value") == "high"
        )

        if urgent and important:
            prioritized["do_first"].append(rec)
        elif not urgent and important:
            prioritized["schedule"].append(rec)
        elif urgent and not important:
            prioritized["quick_wins"].append(rec)
        else:
            prioritized["eliminate"].append(rec)

    return prioritized

Execution Order :

Do First (Urgent + Important) - Execute immediately
Quick Wins (Urgent + Not Important) - Execute if time permits
Schedule (Not Urgent + Important) - Plan for future iteration
Eliminate (Not Urgent + Not Important) - Reject or defer indefinitely

User Preference Integration

Preference Alignment Scoring

Purpose : Quantify how well a recommendation matches user preferences

Implementation :

def calculate_preference_alignment(recommendation, user_prefs):
    """
    Returns 0-100 score for preference alignment
    """
    alignment_score = 0
    total_weight = 0

    # 1. Coding Style Alignment (25 points)
    coding_style_weight = 25
    total_weight += coding_style_weight

    if recommendation.get("verbosity") == user_prefs.get("coding_style", {}).get("verbosity"):
        alignment_score += coding_style_weight
    elif abs(verbosity_scale(recommendation.get("verbosity")) -
             verbosity_scale(user_prefs.get("coding_style", {}).get("verbosity"))) <= 1:
        alignment_score += coding_style_weight * 0.7  # Partial credit

    # 2. Quality Priority Alignment (30 points)
    quality_weight = 30
    total_weight += quality_weight

    user_quality_priorities = user_prefs.get("quality_priorities", {})
    rec_quality_focus = recommendation.get("quality_focus", [])

    # Check if recommendation focuses on user's top priorities
    matches = len([p for p in rec_quality_focus if user_quality_priorities.get(p, 0) >= 0.7])
    if matches > 0:
        alignment_score += quality_weight * (matches / len(rec_quality_focus))

    # 3. Workflow Compatibility (25 points)
    workflow_weight = 25
    total_weight += workflow_weight

    # Check auto-fix threshold
    if recommendation.get("confidence", 0) >= user_prefs.get("workflow", {}).get("auto_fix_threshold", 0.85):
        alignment_score += workflow_weight
    elif recommendation.get("confidence", 0) >= user_prefs.get("workflow", {}).get("auto_fix_threshold", 0.85) - 0.1:
        alignment_score += workflow_weight * 0.5

    # 4. Communication Style Alignment (20 points)
    comm_weight = 20
    total_weight += comm_weight

    rec_detail = recommendation.get("detail_level", "balanced")
    user_detail = user_prefs.get("communication", {}).get("detail_level", "balanced")

    if rec_detail == user_detail:
        alignment_score += comm_weight
    elif abs(detail_scale(rec_detail) - detail_scale(user_detail)) <= 1:
        alignment_score += comm_weight * 0.6

    return alignment_score

Preference-Based Plan Adjustment

Purpose : Adjust execution plan to match user preferences

Example :

def adjust_plan_for_preferences(plan, user_prefs):
    """
    Modify execution plan to incorporate user preferences
    """
    adjusted_plan = plan.copy()

    # Adjust coding style
    if user_prefs.get("coding_style", {}).get("verbosity") == "concise":
        adjusted_plan["style_instructions"] = {
            "comments": "minimal",
            "docstrings": "one_line_only",
            "variable_names": "short_but_clear"
        }
    elif user_prefs.get("coding_style", {}).get("verbosity") == "verbose":
        adjusted_plan["style_instructions"] = {
            "comments": "extensive",
            "docstrings": "detailed_with_examples",
            "variable_names": "descriptive"
        }

    # Adjust quality targets based on user priorities
    quality_prefs = user_prefs.get("quality_priorities", {})
    adjusted_plan["quality_targets"] = {
        "tests": 70 + (quality_prefs.get("tests", 0.5) * 30),  # 70-100
        "documentation": 60 + (quality_prefs.get("documentation", 0.5) * 40),  # 60-100
        "code_quality": 75 + (quality_prefs.get("code_quality", 0.5) * 25)  # 75-100
    }

    # Adjust risk tolerance
    risk_tolerance = user_prefs.get("workflow", {}).get("risk_tolerance", "medium")
    if risk_tolerance == "low":
        adjusted_plan["constraints"]["max_auto_fix"] = 0.95  # Only very safe fixes
        adjusted_plan["require_confirmation"] = True
    elif risk_tolerance == "high":
        adjusted_plan["constraints"]["max_auto_fix"] = 0.75  # More aggressive fixes
        adjusted_plan["require_confirmation"] = False

    return adjusted_plan

Trade-Off Analysis

Framework: Balanced Trade-Off Evaluation

Common Trade-Offs :

Speed vs Quality
Risk vs Benefit
Short-term vs Long-term
Simplicity vs Flexibility
Performance vs Readability

Implementation :

def analyze_trade_offs(recommendation):
    """
    Identify and evaluate trade-offs in a recommendation
    """
    trade_offs = []

    # Trade-off 1: Speed vs Quality
    if recommendation.get("estimated_effort_hours", 0) < 2:
        # Quick implementation
        trade_offs.append({
            "type": "speed_vs_quality",
            "chosen": "speed",
            "gain": "Fast implementation, quick delivery",
            "cost": "May not achieve highest quality, might need refinement later",
            "acceptable": True  # Generally acceptable for small changes
        })

    # Trade-off 2: Risk vs Benefit
    benefit_score = recommendation.get("expected_impact_score", 50)
    risk_score = recommendation.get("risk_score", 50)

    if benefit_score > 80 and risk_score > 60:
        trade_offs.append({
            "type": "risk_vs_benefit",
            "chosen": "benefit",
            "gain": f"High benefit ({benefit_score}/100)",
            "cost": f"Moderate to high risk ({risk_score}/100)",
            "acceptable": benefit_score > risk_score * 1.3,  # Benefit outweighs risk by 30%+
            "mitigation": "Add extra testing, implement in phases, have rollback plan"
        })

    # Trade-off 3: Short-term vs Long-term
    if recommendation.get("type") == "quick_fix" and recommendation.get("technical_debt_added", 0) > 0:
        trade_offs.append({
            "type": "short_term_vs_long_term",
            "chosen": "short_term",
            "gain": "Immediate problem resolution",
            "cost": "Adds technical debt, will need proper fix later",
            "acceptable": recommendation.get("severity") == "critical",  # OK for critical fixes
            "followup": "Schedule proper refactoring in next sprint"
        })

    return trade_offs

Decision Rule :

def should_accept_trade_off(trade_off, user_prefs):
    """
    Decide if a trade-off is acceptable
    """
    # Check if user preferences lean toward chosen side
    if trade_off["type"] == "speed_vs_quality":
        if user_prefs.get("workflow", {}).get("prefer_speed"):
            return True
        elif user_prefs.get("quality_priorities", {}).get("code_quality", 0.5) > 0.8:
            return False  # User prioritizes quality

    # Check if gains outweigh costs
    if trade_off.get("gain_score", 0) > trade_off.get("cost_score", 0) * 1.5:
        return True  # 50% more gain than cost

    # Check if mitigation strategies exist
    if trade_off.get("mitigation") and len(trade_off.get("mitigation", "")) > 10:
        return True  # Has mitigation plan

    return trade_off.get("acceptable", False)

Planning Strategies

Strategy 1: Incremental Execution Plan

When to Use : Large changes, high risk, or complex refactoring

Structure :

incremental_plan = {
    "approach": "incremental",
    "phases": [
        {
            "phase": 1,
            "name": "Foundation",
            "tasks": ["Extract core functions", "Add tests for extracted functions"],
            "duration_hours": 2,
            "validation_criteria": "All tests pass, coverage ≥ 80%",
            "rollback_plan": "Revert extraction if tests fail"
        },
        {
            "phase": 2,
            "name": "Integration",
            "tasks": ["Update callers to use extracted functions", "Add integration tests"],
            "duration_hours": 1.5,
            "validation_criteria": "No regressions, all integration tests pass",
            "rollback_plan": "Keep old functions as fallback"
        },
        {
            "phase": 3,
            "name": "Cleanup",
            "tasks": ["Remove old code", "Update documentation"],
            "duration_hours": 0.5,
            "validation_criteria": "No dead code, docs updated",
            "rollback_plan": "None needed - previous phases validated"
        }
    ],
    "total_duration_hours": 4,
    "checkpoint_frequency": "after_each_phase"
}

Benefits :

Lower risk (validate after each phase)
Can stop early if issues arise
Easier to debug problems
Better for learning (feedback after each phase)

Strategy 2: Comprehensive Execution Plan

When to Use : Well-understood changes, low risk, small scope

Structure :

comprehensive_plan = {
    "approach": "comprehensive",
    "tasks": [
        {
            "task": "Refactor authentication module",
            "subtasks": [
                "Extract validation logic",
                "Extract authentication logic",
                "Extract authorization logic",
                "Add tests for all components",
                "Update callers",
                "Remove old code",
                "Update documentation"
            ],
            "duration_hours": 4,
            "validation_criteria": "All tests pass, coverage ≥ 80%, no regressions"
        }
    ],
    "checkpoint_frequency": "at_end_only"
}

Benefits :

Faster execution (no phase overhead)
Simpler coordination
Good for routine changes

Strategy 3: Parallel Execution Plan

When to Use : Independent changes that can happen simultaneously

Structure :

parallel_plan = {
    "approach": "parallel",
    "parallel_tracks": [
        {
            "track": "backend",
            "agent": "quality-controller",
            "tasks": ["Refactor API endpoints", "Add backend tests"],
            "duration_hours": 3
        },
        {
            "track": "frontend",
            "agent": "frontend-analyzer",
            "tasks": ["Update React components", "Add frontend tests"],
            "duration_hours": 2.5
        },
        {
            "track": "documentation",
            "agent": "documentation-generator",
            "tasks": ["Update API docs", "Update user guide"],
            "duration_hours": 1
        }
    ],
    "coordination_points": [
        {
            "after_hours": 2,
            "sync": "Ensure API contract matches frontend expectations"
        }
    ],
    "total_duration_hours": 3  # Max of parallel tracks
}

Benefits :

Fastest total time
Efficient use of multiple agents
Good for full-stack changes

Risks :

Coordination complexity
Integration issues if not synced properly

Confidence Calibration

Framework: Adjust Confidence Based on Context

Purpose : Calibrate recommendation confidence based on additional factors

Implementation :

def calibrate_confidence(recommendation, context):
    """
    Adjust recommendation confidence based on context
    Returns adjusted confidence (0.0-1.0)
    """
    base_confidence = recommendation.get("confidence", 0.5)

    # Adjustment factors
    adjustments = []

    # 1. Historical success with similar tasks
    similar_tasks = query_similar_tasks(recommendation)
    if similar_tasks:
        success_rate = sum(t.success for t in similar_tasks) / len(similar_tasks)
        if success_rate >= 0.9:
            adjustments.append(("high_historical_success", +0.1))
        elif success_rate <= 0.5:
            adjustments.append(("low_historical_success", -0.15))

    # 2. Untested pattern penalty
    pattern_reuse = recommendation.get("pattern_reuse_count", 0)
    if pattern_reuse == 0:
        adjustments.append(("untested_pattern", -0.1))
    elif pattern_reuse >= 5:
        adjustments.append(("proven_pattern", +0.05))

    # 3. Complexity factor
    complexity = recommendation.get("complexity", "medium")
    if complexity == "high":
        adjustments.append(("high_complexity", -0.1))
    elif complexity == "low":
        adjustments.append(("low_complexity", +0.05))

    # 4. User preference mismatch
    pref_alignment = calculate_preference_alignment(recommendation, context.get("user_prefs", {}))
    if pref_alignment < 50:
        adjustments.append(("low_preference_alignment", -0.08))

    # Apply adjustments
    adjusted_confidence = base_confidence
    for reason, delta in adjustments:
        adjusted_confidence += delta

    # Clamp to [0.0, 1.0]
    adjusted_confidence = max(0.0, min(1.0, adjusted_confidence))

    return {
        "original_confidence": base_confidence,
        "adjusted_confidence": adjusted_confidence,
        "adjustments": adjustments
    }

Decision Explainability

Framework: Document Every Decision

Purpose : Create transparent, understandable decisions for users and learning

Implementation :

from lib.decision_explainer import create_explanation

def create_decision_explanation(decision, recommendations, user_prefs, historical_data):
    """
    Create comprehensive explanation for a decision
    """
    explanation = create_explanation(
        decision_id=f"decision_{task_id}",
        decision=decision,
        recommendations=recommendations,
        user_preferences=user_prefs,
        historical_data=historical_data,
        context={
            "task_type": "refactoring",
            "complexity": "medium"
        }
    )

    return explanation
    # Returns:
    # - why_chosen: Primary reasons for this decision
    # - why_not_alternatives: Why other options rejected
    # - trade_offs: What was gained vs what was sacrificed
    # - confidence_factors: What increases/decreases confidence
    # - user_alignment: How decision aligns with user preferences
    # - analogy: Human-friendly comparison

Example Explanation Output :

## Decision: Modular Refactoring Approach

### Why This Decision?
**Primary Reason**: Highest combined score (82.5/100) balancing quality impact, effort, and risk.

**Supporting Reasons**:
1. Strong quality improvement potential (90/100)
2. Manageable effort (60/100 - approximately 4 hours)
3. Low risk with clear rollback options (80/100)
4. Excellent maintainability benefits (95/100)

### Why Not Alternatives?
**Incremental Refactoring (Score: 81.0)**: Close second, but lower quality impact (70 vs 90). Would take longer to achieve same quality level.

**Complete Rewrite (Score: 63.0)**: Rejected due to:
- Very high effort (20/100 - would take 20+ hours)
- High risk (40/100 - could introduce many bugs)
- Lower user alignment (60/100 - user prefers incremental changes)
Despite perfect quality potential, the risk-benefit ratio is unfavorable.

### Trade-offs Considered
**Time vs Quality**: Choosing modular approach over quick incremental fixes means:
- ✅ Gain: Significantly better long-term code quality
- ⚠️ Cost: Takes 1.5x longer than incremental approach
- ✓ Acceptable: Quality improvement worth the extra time

**Risk vs Benefit**: Moderate complexity with high reward:
- ✅ Gain: 90/100 quality improvement potential
- ⚠️ Cost: Some architectural risk in module boundaries
- ✓ Mitigation: Incremental implementation with validation checkpoints

### Confidence Factors
**High Confidence (0.88)**:
- ✓ Similar pattern succeeded 5 times previously (100% success rate)
- ✓ Strong alignment with user preferences (85/100)
- ⚠️ Moderate complexity reduces confidence slightly (-0.05)

### User Preference Alignment
- Coding Style: ✓ Matches preference for modular, well-organized code
- Quality Focus: ✓ User prioritizes maintainability (0.85) - this approach excels here
- Risk Tolerance: ✓ Medium risk acceptable for high-quality outcomes

### Analogy
Like reorganizing a messy closet by sorting items into clearly labeled boxes (modular refactoring) rather than just pushing things around (incremental) or building an entirely new closet system (complete rewrite). The sorting approach takes reasonable time, dramatically improves organization, and can be done safely one section at a time.

Success Metrics

Effective Decision-Making Indicators :

✅ Decision confidence > 0.80 (well-supported decisions)
✅ User preference alignment > 75% (decisions match user style)
✅ Execution success rate > 90% (Group 3 successfully executes plans)
✅ Plan adjustment rate < 20% (plans don't need major revision during execution)
✅ User satisfaction > 85% (users accept decisions)
✅ Decision explainability score > 80% (users understand why decisions were made)

Track with :

from lib.agent_performance_tracker import get_agent_performance

performance = get_agent_performance("strategic-planner")
print(f"Decision success rate: {performance['success_rate']:.1%}")
print(f"Average confidence: {performance['avg_confidence']:.2f}")
print(f"User approval rate: {performance['user_approval_rate']:.1%}")

References

Related Systems :

lib/decision_explainer.py - Decision explanation system
lib/user_preference_learner.py - User preference tracking
lib/agent_performance_tracker.py - Decision outcome tracking
lib/inter_group_knowledge_transfer.py - Historical success data

Related Documentation :

docs/FOUR_TIER_ARCHITECTURE.md - Complete architecture
agents/strategic-planner.md - Master decision-maker agent
agents/preference-coordinator.md - User preference specialist
skills/group-collaboration/SKILL.md - Inter-group communication

Weekly Installs

Repository

bejranonda/llm-…r-claude

GitHub Stars

First Seen

Jan 1, 1970

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

AI 代码实施计划编写技能 | 自动化开发任务分解与 TDD 流程规划工具

41,400 周安装

决策框架技能：AI智能体决策规划与评分方法，优化战略分析与执行计划

🇨🇳中文介绍

决策框架技能

概述

何时应用此技能

第 2 组角色回顾

决策框架

框架 1：建议评估矩阵

相关 Skills

框架 2：多标准决策分析（MCDA）

框架 3：风险收益分析

框架 4：优先级矩阵（艾森豪威尔矩阵）

用户偏好整合

偏好一致性评分

基于偏好的计划调整

权衡分析

框架：平衡的权衡评估

规划策略

策略 1：增量执行计划

策略 2：全面执行计划

策略 3：并行执行计划

置信度校准

框架：根据上下文调整置信度

决策可解释性

框架：记录每个决策

成功指标

参考

🇺🇸English

Decision Frameworks Skill

Overview

When to Apply This Skill

Group 2 Role Recap

Decision-Making Frameworks

Framework 1: Recommendation Evaluation Matrix

Framework 2: Multi-Criteria Decision Analysis (MCDA)

Framework 3: Risk-Benefit Analysis

Framework 4: Prioritization Matrix (Eisenhower Matrix)

User Preference Integration

Preference Alignment Scoring

Preference-Based Plan Adjustment

Trade-Off Analysis

Framework: Balanced Trade-Off Evaluation

Planning Strategies

Strategy 1: Incremental Execution Plan

Strategy 2: Comprehensive Execution Plan

Strategy 3: Parallel Execution Plan

Confidence Calibration

Framework: Adjust Confidence Based on Context

Decision Explainability

Framework: Document Every Decision

Success Metrics

References

最新 Skills