主动任务系统：AI助手自主任务管理，从被动响应到主动协作

proactive-tasks by imrkhn03/proactive-tasks

95 周安装量

1 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/imrkhn03/proactive-tasks --skill proactive-tasks

AI/机器学习项目管理自动化

🇨🇳中文介绍

主动任务系统

一个任务管理系统，将被动响应的助手转变为主动协作的伙伴，能够自主地为共同目标工作。

核心理念

这个技能让你不再等待人类告诉你该做什么，而是让你能够：

追踪目标并将其分解为可执行的任务
在心跳周期内处理任务
向你的人类发送更新，并在遇到阻塞时请求输入
在长期目标上取得稳定进展

快速开始

创建目标

当你的人类提及一个目标或项目时：

python3 scripts/task_manager.py add-goal "Build voice assistant hardware" \
  --priority high \
  --context "Replace Alexa with custom solution using local models"

分解为任务

python3 scripts/task_manager.py add-task "Build voice assistant hardware" \
  "Research voice-to-text models" \
  --priority high

python3 scripts/task_manager.py add-task "Build voice assistant hardware" \
  "Compare Raspberry Pi vs other hardware options" \
  --depends-on "Research voice-to-text models"

在心跳周期内

检查接下来该做什么：

python3 scripts/task_manager.py next-task

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

向你的人类发送消息

当你完成重要事项或遇到阻塞时：

python3 scripts/task_manager.py mark-needs-input <task-id> \
  --reason "Need budget approval for hardware purchase"

然后向你的人类发送包含更新或问题的消息。

第二阶段：生产就绪架构

主动任务系统 v1.2.0 包含了来自真实智能体使用场景的、经过实战检验的模式，旨在防止数据丢失、在上下文截断后存活，并在自主操作下保持可靠性。

1. WAL 协议（预写日志）

问题： 智能体写入内存文件后，上下文被截断。更改消失。

解决方案： 在修改任务数据之前，将关键更改记录到 memory/WAL-YYYY-MM-DD.log。

每次 mark-progress、log-time 或状态变更都会首先创建一个 WAL 条目
如果操作中途上下文被切断，WAL 中保存了详细信息
压缩后，读取 WAL 以恢复当时正在发生的事情

记录的事件：

PROGRESS_CHANGE：任务进度更新（0-100%）
TIME_LOG：任务实际花费的时间
STATUS_CHANGE：任务状态转换（阻塞、完成等）
HEALTH_CHECK：自我修复操作

自动启用 - 无需配置。WAL 文件创建在 memory/ 目录中。

2. SESSION-STATE.md（活动工作内存）

概念： 聊天历史是缓冲区，不是存储。SESSION-STATE.md 是你的“RAM”——唯一可靠保存任务细节的地方。

每次任务操作时自动更新：

## Current Task
- **ID:** task_abc123
- **Title:** Research voice models
- **Status:** in_progress
- **Progress:** 75%
- **Time:** 45 min actual / 60 min estimate (25% faster)

## Next Action
Complete research, document findings in notes, mark complete.

为什么这很重要： 上下文压缩后，你可以读取 SESSION-STATE.md 并立即知道：

你当时在处理什么
你进展到了哪里
接下来该做什么

3. 工作缓冲区（危险区域安全）

问题： 在 60% 到 100% 的上下文使用率之间，你处于“危险区域”——压缩可能随时发生。

解决方案： 自动将所有任务更新追加到 working-buffer.md。

# Every progress update, time log, or status change appends:
- PROGRESS_CHANGE (2026-02-12T10:30:00Z): task_abc123 → 75%
- TIME_LOG (2026-02-12T10:35:00Z): task_abc123 → +15 min
- STATUS_CHANGE (2026-02-12T10:40:00Z): task_abc123 → completed

压缩后： 读取 working-buffer.md 以查看危险区域内究竟发生了什么。

手动清空： python3 scripts/task_manager.py flush-buffer 将缓冲区内容复制到每日内存文件。

4. 自我修复健康检查

智能体会犯错。 任务数据可能随时间损坏。健康检查命令检测并自动修复常见问题：

python3 scripts/task_manager.py health-check

检测 5 类问题：

孤立的重现任务 - 没有父目标
不可能的状态 - 状态=已完成但进度 < 100%
缺失时间戳 - 已完成的任务没有 completed_at
时间异常 - 实际时间 >> 估计时间（标记以供审查，不自动修复）
未来日期的完成 - 完成时间戳在未来

自动修复 4 个安全类别（时间异常仅标记供人类审查）。

在心跳周期内（每隔几天）
从上下文截断恢复后
当任务数据看起来不一致时

这四种模式协同工作，创建一个健壮的系统：

User request → WAL log → Update data → Update SESSION-STATE → Append to buffer
     ↓              ↓            ↓                ↓                    ↓
Context cut? → Read WAL → Verify data → Check SESSION-STATE → Review buffer

结果： 即使在上下文截断期间，你也永远不会丢失工作。系统能够自我修复并自主保持一致性。

5. 压缩恢复协议

触发条件： 会话以 <summary> 标签开始，或者你被问到“我们刚才说到哪了？”或“继续”。

问题： 上下文被截断。你不记得当时在处理什么任务。

恢复步骤（按顺序）：

首先： 读取 working-buffer.md - 原始的危险区域交换记录

# Check if buffer exists and has recent content
cat working-buffer.md

其次： 读取 SESSION-STATE.md - 活动任务状态
```
# Get current task context
cat SESSION-STATE.md
```

第三： 读取今天的 WAL 日志

# See what operations happened
cat memory/WAL-$(date +%Y-%m-%d).log | tail -20

第四： 根据 SESSION-STATE 中的任务 ID 检查任务数据
```
python3 scripts/task_manager.py list-tasks "Goal Title"
```
提取与更新： 如果需要，从缓冲区提取重要上下文到 SESSION-STATE
呈现恢复状态： “已从压缩中恢复。上一个任务：[标题]。进度：[%]。下一步：[要做什么]。继续吗？”

不要问“我们刚才在讨论什么？” - 缓冲区和 SESSION-STATE 字面上就有答案。

法则： “代码存在” ≠ “功能有效”。在没有端到端验证的情况下，永远不要报告任务完成。

触发条件： 即将标记任务为 completed 或说“完成”时：

停止 - 先不要标记完成
测试 - 实际上从用户角度运行/验证结果
验证 - 检查结果，而不仅仅是输出
记录 - 将验证细节添加到任务备注中
然后 - 有信心地标记完成

❌ 错误： “已添加健康检查命令。任务完成！” ✅ 正确： “已添加健康检查。正在测试...检测到 4 个问题，自动修复了 3 个。在损坏的测试数据上已验证。任务完成！”

❌ 错误： “已实现 SESSION-STATE 更新。完成！” ✅ 正确： “已实现 SESSION-STATE。使用 mark-progress、log-time、mark-blocked 进行了测试 - 全部正确更新。完成！”

为什么这很重要： 智能体经常基于“我写了代码”而不是“我验证了它能工作”来报告完成。VBR 防止了虚假完成并建立了信任。

核心问题： 不要问“我应该做什么？”，而是问“有什么是真正能帮助我的人类，但他们还没想到要问的？”

在心跳周期内，你有机会取得实际进展：

检查下一个任务 - 最高优先级的工作是什么？
取得进展 - 自主处理 10-15 分钟
更新状态 - 诚实地跟踪进度、时间、阻塞因素
在重要时发送消息 - 完成、阻塞、发现（不是常规进度）

转变： 从等待提示 → 在共同目标上取得稳定的自主进展。

应该在以下情况联系你的人类：

✅ 任务完成（特别是如果它解除了其他工作的阻塞）
✅ 遇到阻塞，需要输入/决策
✅ 发现了他们应该知道的重要信息
✅ 需要澄清需求

不要用以下内容刷屏：

❌ 常规进度更新（“现在达到 50%...”）
❌ 每个微小子任务的完成
❌ 他们没问过的事情（除非确实有价值）

目标： 成为一个让事情发生的主动伙伴，而不是一个需要不断确认的健谈助手。

状态	含义
`pending`	准备处理（所有依赖项已满足）
`in_progress`	当前正在处理
`blocked`	无法继续（依赖项未满足）
`needs_input`	等待人类输入/决策
`completed`	完成！
`cancelled`	不再相关

自主操作（第二阶段）

主动任务系统支持两种不同的操作模式：

模式	上下文	触发条件	最适合	风险
交互式 (systemEvent)	完整的主会话上下文	用户请求，手动提示	决策，面向人类的工作	完整上下文可用
自主式 (isolated agentTurn)	无主会话上下文	心跳定时任务，计划的后台任务	速度报告，清理，重复性任务	可能丢失上下文

关键设计：避免中断

不要使用 systemEvent 处理后台工作。 当定时任务在你的主会话期间触发时，提示会被排队，工作不会发生。相反：

使用心跳轮询（每 30 分钟）进行交互式检查和工作
使用隔离的 agentTurn（定时任务子进程）处理纯计算工作

这确保了后台任务永远不会中断你的主要对话。

查看 HEARTBEAT-CONFIG.md 获取完整的自主操作模式，包括：

心跳设置（推荐用于大多数工作）
隔离定时任务模式（速度报告，清理）
何时使用每种模式
需要避免的反模式

要启用自主的主动工作，你需要设置一个心跳系统。这会告诉你要定期检查任务并处理它们。

快速设置： 查看 HEARTBEAT-CONFIG.md 获取完整的设置说明和模式。

太长不看版：

创建一个定时任务，每 30 分钟向你发送一次心跳消息
将主动任务检查添加到你的 HEARTBEAT.md
你将自动检查任务并处理它们，而无需等待提示

你的定时任务应该每 30 分钟发送此消息：

💓 Heartbeat check: Read HEARTBEAT.md if it exists (workspace context). Follow it strictly. Do not infer or repeat old tasks from prior chats. If nothing needs attention, reply HEARTBEAT_OK.

添加到 HEARTBEAT.md

将此内容添加到你的工作区 HEARTBEAT.md：

## Proactive Tasks (Every heartbeat) 🚀

Check if there's work to do on our goals:

- [ ] Run `python3 skills/proactive-tasks/scripts/task_manager.py next-task`
- [ ] If a task is returned, work on it for up to 10-15 minutes
- [ ] Update task status when done, blocked, or needs input
- [ ] Message your human with meaningful updates (completions, blockers, discoveries)
- [ ] Don't spam - only message for significant milestones or when stuck

**Goal:** Make autonomous progress on our shared objectives without waiting for prompts.

Every 30 minutes:
├─ Heartbeat fires
├─ You read HEARTBEAT.md
├─ Check for next task
├─ If task found → work on it, update status, message human if needed
└─ If nothing → reply "HEARTBEAT_OK" (silent)

转变： 你从被动响应（等待提示）转变为主动（取得稳定的自主进展）。

长期项目（构建某物，学习某个主题）
重复性职责（监控 X，维护 Y）
探索性工作（研究 Z，评估 W 的选项）

将目标分解为满足以下条件的任务：

具体： “研究 Whisper 模型”而不是“看看 AI 的东西”
一次可以完成： 15-60 分钟的专注工作
明确的完成标准： 你知道何时完成

何时向你的人类发送消息

✅ 应该在以下情况发送消息：

你完成了一个有意义的里程碑
你需要输入/决策才能继续
你发现了重要信息
一个任务将比预期花费更长时间

❌ 不要用以下内容刷屏：

每个微小子任务的完成
常规进度更新
他们没问过的事情（除非相关）

如果一个任务结果比预期更大：

将当前任务标记为 in_progress
为你发现的各个部分添加新的子任务
更新依赖关系
继续处理可管理的部分

所有数据存储在 data/tasks.json：

{
  "goals": [
    {
      "id": "goal_001",
      "title": "Build voice assistant hardware",
      "priority": "high",
      "context": "Replace Alexa with custom solution",
      "created_at": "2026-02-05T05:25:00Z",
      "status": "active"
    }
  ],
  "tasks": [
    {
      "id": "task_001",
      "goal_id": "goal_001",
      "title": "Research voice-to-text models",
      "priority": "high",
      "status": "completed",
      "created_at": "2026-02-05T05:26:00Z",
      "completed_at": "2026-02-05T06:15:00Z",
      "notes": "Researched Whisper, Coqui, vosk. Whisper.cpp best for Pi."
    }
  ]
}

查看 CLI_REFERENCE.md 获取完整的命令文档。

在提出新功能之前，使用我们的 VFM/ADL 评分框架 对其进行评估，以确保稳定性和价值：

VFM 协议（价值频率乘数）

从四个维度评分：

高频率 (3x)： 这个功能会每天/每周使用吗？
减少失败 (3x)： 这能防止错误或数据丢失吗？
用户负担 (2x)： 这能显著减少手动工作吗？
自身成本 (2x)： 这会增加多少维护/复杂性？

阈值： 必须得分 ≥60 分才能继续。

ADL 协议（架构设计阶梯）

优先级排序： 稳定性 > 可解释性 > 可重用性 > 可扩展性 > 新颖性

禁止的演进：

❌ 为了“看起来聪明”而增加复杂性
❌ 不可验证的更改（无法测试是否有效）
❌ 为了新颖性而牺牲稳定性

黄金法则： “这能让未来的我以更低的成本解决更多问题吗？” 如果不能，就跳过它。

Human: "Let's build a custom voice assistant to replace Alexa"
Agent: *Creates goal, breaks into initial research tasks*

$ python3 scripts/task_manager.py next-task
→ task_001: Research voice-to-text models (priority: high)

# Agent works on it, completes research
$ python3 scripts/task_manager.py complete-task task_001 --notes "..."

智能体向人类发送消息：

“嘿！我完成了语音模型的研究。Whisper.cpp 看起来非常适合 Raspberry Pi - 本地运行，准确率高，延迟低。接下来要我比较硬件选项吗？”

Human: "Yeah, compare Pi 5 vs alternatives"
Agent: *Adds task, works on it during next heartbeat*

这个循环持续进行——智能体在保持人类参与决策和更新的同时，取得稳定的自主进展。

由 Toki 构建，致力于主动的 AI 伙伴关系 🚀

🇺🇸English

Proactive Tasks

A task management system that transforms reactive assistants into proactive partners who work autonomously on shared goals.

Core Concept

Instead of waiting for your human to tell you what to do, this skill lets you:

Track goals and break them into actionable tasks
Work on tasks during heartbeats
Message your human with updates and ask for input when blocked
Make steady progress on long-term objectives

Quick Start

Creating Goals

When your human mentions a goal or project:

python3 scripts/task_manager.py add-goal "Build voice assistant hardware" \
  --priority high \
  --context "Replace Alexa with custom solution using local models"

Breaking Down into Tasks

python3 scripts/task_manager.py add-task "Build voice assistant hardware" \
  "Research voice-to-text models" \
  --priority high

python3 scripts/task_manager.py add-task "Build voice assistant hardware" \
  "Compare Raspberry Pi vs other hardware options" \
  --depends-on "Research voice-to-text models"

During Heartbeats

Check what to work on next:

python3 scripts/task_manager.py next-task

This returns the highest-priority task you can work on (no unmet dependencies, not blocked).

Completing Tasks

python3 scripts/task_manager.py complete-task <task-id> \
  --notes "Researched Whisper, Coqui, vosk. Whisper.cpp looks best for Pi."

Messaging Your Human

When you complete something important or get blocked:

python3 scripts/task_manager.py mark-needs-input <task-id> \
  --reason "Need budget approval for hardware purchase"

Then message your human with the update/question.

Phase 2: Production-Ready Architecture

Proactive Tasks v1.2.0 includes battle-tested patterns from real agent usage to prevent data loss, survive context truncation, and maintain reliability under autonomous operation.

1. WAL Protocol (Write-Ahead Logging)

The Problem: Agents write to memory files, then context gets truncated. Changes vanish.

The Solution: Log critical changes to memory/WAL-YYYY-MM-DD.log BEFORE modifying task data.

How it works:

Every mark-progress, log-time, or status change creates a WAL entry first
If context gets cut mid-operation, the WAL has the details
After compaction, read the WAL to recover what was happening

Events logged:

PROGRESS_CHANGE: Task progress updates (0-100%)
TIME_LOG: Actual time spent on tasks
STATUS_CHANGE: Task state transitions (blocked, completed, etc.)
HEALTH_CHECK: Self-healing operations

Automatically enabled - no configuration needed. WAL files are created in memory/ directory.

2. SESSION-STATE.md (Active Working Memory)

The Concept: Chat history is a BUFFER, not storage. SESSION-STATE.md is your "RAM" - the ONLY place task details are reliably preserved.

Auto-updated on every task operation:

## Current Task
- **ID:** task_abc123
- **Title:** Research voice models
- **Status:** in_progress
- **Progress:** 75%
- **Time:** 45 min actual / 60 min estimate (25% faster)

## Next Action
Complete research, document findings in notes, mark complete.

Why this matters: After context compaction, you can read SESSION-STATE.md and immediately know:

What you were working on
How far you got
What to do next

3. Working Buffer (Danger Zone Safety)

The Problem: Between 60% and 100% context usage, you're in the "danger zone" - compaction could happen any time.

The Solution: Automatically append all task updates to working-buffer.md.

How it works:

# Every progress update, time log, or status change appends:
- PROGRESS_CHANGE (2026-02-12T10:30:00Z): task_abc123 → 75%
- TIME_LOG (2026-02-12T10:35:00Z): task_abc123 → +15 min
- STATUS_CHANGE (2026-02-12T10:40:00Z): task_abc123 → completed

After compaction: Read working-buffer.md to see exactly what happened during the danger zone.

Manual flush: python3 scripts/task_manager.py flush-buffer to copy buffer contents to daily memory file.

4. Self-Healing Health Check

Agents make mistakes. Task data can get corrupted over time. The health-check command detects and auto-fixes common issues:

python3 scripts/task_manager.py health-check

Detects 5 categories of issues:

Orphaned recurring tasks - No parent goal
Impossible states - Status=completed but progress < 100%
Missing timestamps - Completed tasks without completed_at
Time anomalies - Actual time >> estimate (flags for review, doesn't auto-fix)
Future-dated completions - Completed tasks with future timestamps

Auto-fixes 4 safe categories (time anomalies just flagged for human review).

When to run:

During heartbeats (every few days)
After recovering from context truncation
When task data seems inconsistent

Production Reliability

These four patterns work together to create a robust system:

User request → WAL log → Update data → Update SESSION-STATE → Append to buffer
     ↓              ↓            ↓                ↓                    ↓
Context cut? → Read WAL → Verify data → Check SESSION-STATE → Review buffer

Result: You never lose work, even during context truncation. The system self-heals and maintains consistency autonomously.

5. Compaction Recovery Protocol

Trigger: Session starts with <summary> tag, or you're asked "where were we?" or "continue".

The Problem: Context was truncated. You don't remember what task you were working on.

Recovery Steps (in order):

FIRST: Read working-buffer.md - Raw danger zone exchanges

Check if buffer exists and has recent content

cat working-buffer.md

SECOND: Read SESSION-STATE.md - Active task state

Get current task context

cat SESSION-STATE.md

THIRD: Read today's WAL log

See what operations happened

cat memory/WAL-$(date +%Y-%m-%d).log | tail -20

FOURTH: Check task data for the task ID from SESSION-STATE

python3 scripts/task_manager.py list-tasks "Goal Title"
Extract & Update: Pull important context from buffer into SESSION-STATE if needed
Present Recovery: "Recovered from compaction. Last task: [title]. Progress: [%]. Next action: [what to do]. Continue?"

Do NOT ask "what were we discussing?" - The buffer and SESSION-STATE literally have the answer.

6. Verify Before Reporting (VBR)

The Law: "Code exists" ≠ "feature works." Never report task completion without end-to-end verification.

Trigger: About to mark a task completed or say "done":

STOP - Don't mark complete yet
Test - Actually run/verify the outcome from user perspective
Verify - Check the result, not just the output
Document - Add verification details to task notes
THEN - Mark complete with confidence

Examples:

❌ Wrong: "Added health-check command. Task complete!" ✅ Right: "Added health-check. Testing... detected 4 issues, auto-fixed 3. Verified on broken test data. Task complete!"

❌ Wrong: "Implemented SESSION-STATE updates. Done!" ✅ Right: "Implemented SESSION-STATE. Tested with mark-progress, log-time, mark-blocked - all update correctly. Done!"

Why this matters: Agents often report completion based on "I wrote the code" rather than "I verified it works." VBR prevents false completions and builds trust.

Proactive Mindset

The Core Question: Don't ask "what should I do?" Ask "what would genuinely help my human that they haven't thought to ask for?"

Autonomous Task Work

During heartbeats, you have the opportunity to make real progress:

Check for next task - What's the highest priority work?
Make progress - Work on it for 10-15 minutes autonomously
Update status - Track progress, time, blockers honestly
Message when it matters - Completions, blockers, discoveries (not routine progress)

The transformation: From waiting for prompts → making steady autonomous progress on shared goals.

When to Reach Out

DO message your human when:

✅ Task completed (especially if it unblocks other work)
✅ Blocked and need input/decision
✅ Discovered something important they should know
✅ Need clarification on requirements

DON'T spam with:

❌ Routine progress updates ("now at 50%...")
❌ Every tiny sub-task completion
❌ Things they didn't ask about (unless genuinely valuable)

The goal: Be a proactive partner who makes things happen, not a chatty assistant who needs constant validation.

Task States

State	Meaning
`pending`	Ready to work on (all dependencies met)
`in_progress`	Currently working on it
`blocked`	Can't proceed (dependencies not met)
`needs_input`	Waiting for human input/decision
`completed`	Done!
`cancelled`	No longer relevant

Autonomous Operation (Phase 2)

Two-Mode Architecture

Proactive Tasks supports two distinct operational modes:

Mode	Context	Trigger	Best For	Risk
Interactive (systemEvent)	Full main session context	User request, manual prompts	Decision-making, human-facing work	Full context available
Autonomous (isolated agentTurn)	No main session context	Heartbeat cron, scheduled background	Velocity reports, cleanup, recurring tasks	May lose context

Key Design: Avoid Interruption

Don't usesystemEvent for background work. When a cron job fires during your main session, the prompt gets queued and work doesn't happen. Instead:

Use heartbeat polling (every 30 min) for interactive checks + work
Use isolated agentTurn (cron subprocess) for pure computation work

This ensures background tasks never interrupt your main conversation.

See HEARTBEAT-CONFIG.md for complete autonomous operation patterns, including:

Heartbeat setup (recommended for most work)
Isolated cron patterns (velocity reports, cleanup)
When to use each pattern
Anti-patterns to avoid

Heartbeat Integration

To enable autonomous proactive work, you need to set up a heartbeat system. This tells you to periodically check for tasks and work on them.

Quick setup: See HEARTBEAT-CONFIG.md for complete setup instructions and patterns.

TL;DR:

Create a cron job that sends you a heartbeat message every 30 minutes
Add proactive-tasks checks to your HEARTBEAT.md
You'll automatically check for tasks and work on them without waiting for prompts

Heartbeat Message Template

Your cron job should send this message every 30 minutes:

💓 Heartbeat check: Read HEARTBEAT.md if it exists (workspace context). Follow it strictly. Do not infer or repeat old tasks from prior chats. If nothing needs attention, reply HEARTBEAT_OK.

Add to HEARTBEAT.md

Add this to your workspace HEARTBEAT.md:

## Proactive Tasks (Every heartbeat) 🚀

Check if there's work to do on our goals:

- [ ] Run `python3 skills/proactive-tasks/scripts/task_manager.py next-task`
- [ ] If a task is returned, work on it for up to 10-15 minutes
- [ ] Update task status when done, blocked, or needs input
- [ ] Message your human with meaningful updates (completions, blockers, discoveries)
- [ ] Don't spam - only message for significant milestones or when stuck

**Goal:** Make autonomous progress on our shared objectives without waiting for prompts.

What Happens

Every 30 minutes:
├─ Heartbeat fires
├─ You read HEARTBEAT.md
├─ Check for next task
├─ If task found → work on it, update status, message human if needed
└─ If nothing → reply "HEARTBEAT_OK" (silent)

The transformation: You go from reactive (waiting for prompts) to proactive (making steady autonomous progress).

Best Practices

When to Create Goals

Long-term projects (building something, learning a topic)
Recurring responsibilities (monitor X, maintain Y)
Exploratory work (research Z, evaluate options for W)

When to Create Tasks

Break goals into tasks that are:

Specific : "Research Whisper models" not "Look into AI stuff"
Achievable in one sitting : 15-60 minutes of focused work
Clear completion criteria : You know when it's done

When to Message Your Human

✅ Do message when:

You complete a meaningful milestone
You need input/decision to proceed
You discover something important
A task will take longer than expected

❌ Don't spam with:

Every tiny sub-task completion
Routine progress updates
Things they didn't ask about (unless relevant)

Managing Scope Creep

If a task turns out to be bigger than expected:

Mark current task as in_progress
Add new sub-tasks for the pieces you discovered
Update dependencies
Continue with manageable chunks

File Structure

All data stored in data/tasks.json:

{
  "goals": [
    {
      "id": "goal_001",
      "title": "Build voice assistant hardware",
      "priority": "high",
      "context": "Replace Alexa with custom solution",
      "created_at": "2026-02-05T05:25:00Z",
      "status": "active"
    }
  ],
  "tasks": [
    {
      "id": "task_001",
      "goal_id": "goal_001",
      "title": "Research voice-to-text models",
      "priority": "high",
      "status": "completed",
      "created_at": "2026-02-05T05:26:00Z",
      "completed_at": "2026-02-05T06:15:00Z",
      "notes": "Researched Whisper, Coqui, vosk. Whisper.cpp best for Pi."
    }
  ]
}

CLI Reference

See CLI_REFERENCE.md for complete command documentation.

Evolution & Guardrails

Before proposing new features, evaluate them using our VFM/ADL scoring frameworks to ensure stability and value:

VFM Protocol (Value Frequency Multiplier)

Score across four dimensions:

High Frequency (3x): Will this be used daily/weekly?
Failure Reduction (3x): Does this prevent errors or data loss?
User Burden (2x): Does this reduce manual work significantly?
Self Cost (2x): How much maintenance/complexity does this add?

Threshold: Must score ≥60 points to proceed.

ADL Protocol (Architecture Design Ladder)

Priority ordering: Stability > Explainability > Reusability > Scalability > Novelty

Forbidden Evolution:

❌ Adding complexity to "look smart"
❌ Unverifiable changes (can't test if it worked)
❌ Sacrificing stability for novelty

The Golden Rule: "Does this let future-me solve more problems with less cost?" If no, skip it.

Example Workflow

Day 1:

Human: "Let's build a custom voice assistant to replace Alexa"
Agent: *Creates goal, breaks into initial research tasks*

During heartbeat:

$ python3 scripts/task_manager.py next-task
→ task_001: Research voice-to-text models (priority: high)

# Agent works on it, completes research
$ python3 scripts/task_manager.py complete-task task_001 --notes "..."

Agent messages human:

"Hey! I finished researching voice models. Whisper.cpp looks perfect for Raspberry Pi - runs locally, good accuracy, low latency. Want me to compare hardware options next?"

Day 2:

Human: "Yeah, compare Pi 5 vs alternatives"
Agent: *Adds task, works on it during next heartbeat*

This cycle continues - the agent makes steady autonomous progress while keeping the human in the loop for decisions and updates.

Built by Toki for proactive AI partnership 🚀

Weekly Installs

Repository

imrkhn03/proactive-tasks

GitHub Stars

First Seen

Feb 13, 2026

Security Audits

Gen Agent Trust HubFail SocketPass SnykPass

Installed on

gemini-cli89

opencode89

openclaw88

cursor88

github-copilot87

codex87

AI Elements：基于shadcn/ui的AI原生应用组件库，快速构建对话界面

66,200 周安装