研究思路头脑风暴：10个结构化框架助你生成AI/机器学习创新研究提案

brainstorming-research-ideas by orchestra-research/ai-research-skills

77 周安装量

5,600 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/orchestra-research/ai-research-skills --skill brainstorming-research-ideas

AI/机器学习研究方法科研工具

🇨🇳中文介绍

研究思路头脑风暴

用于发现下一个研究想法的结构化框架。本技能提供十个互补的构思视角，帮助研究者从模糊的好奇心转向具体、可论证的研究提案。每个框架针对不同的认知模式——可以单独使用，也可以组合使用以进行全面探索。

何时使用此技能

开启新的研究方向并需要结构化探索时
在当前项目中感到停滞，需要新视角时
评估一个尚未成形的想法是否具有真正潜力时
准备与合作者进行头脑风暴会议时
在研究领域之间转换并寻求高杠杆切入点时
回顾某个领域并寻找未被充分探索的空白时

在以下情况请勿使用此技能：

你已经有一个定义明确的研究问题，需要执行指导时
你需要实验设计或方法论的帮助（请使用特定领域的技能）
你需要文献综述（请使用 scientific-skills:literature-review）

核心构思框架

1. 问题先行与方案先行思维

研究想法源于两种不同的模式。了解你处于哪种模式可以避免一个常见失败：构建缺乏真实问题的解决方案，或追逐没有可行方法的问题。

问题先行（痛点 → 方法）：

从一个具体的失败、瓶颈或未满足的需求开始
自然地产生有影响力的工作，因为动机是内在的
风险：可能收敛于渐进式修复而非范式转变

方案先行（新能力 → 应用）：

从一个寻求应用的新工具、见解或技术开始
通常通过解锁以前不可能的方法来推动突破
风险：“拿着锤子找钉子”——方案可能缺乏真实需求

工作流程：

用一句话写下你的想法
分类：这是问题先行还是方案先行？
如果是问题先行 → 验证问题的重要性（谁受影响？影响多大？）

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

方向	行动	成果
向上移动（泛化）	将具体结果转化为更广泛的原则	框架性论文、理论贡献
向下移动（实例化）	在具体约束下测试通用范式	实证性论文、令人惊讶的失败分析
横向移动（类比）	将相同抽象层次应用于相邻领域	交叉融合、迁移性论文

3. 寻找张力与矛盾

突破通常来自解决被广泛接受但看似冲突的目标之间的张力。这些矛盾不是缺陷——它们是研究机会。

常见的研究张力：

张力对	研究机会
性能 ↔ 效率	我们能否用 10 倍少的计算量达到 SOTA？
隐私 ↔ 效用	联邦/加密方法能否缩小准确性的差距？
通用性 ↔ 专用性	微调何时能胜过提示，为什么？
安全性 ↔ 能力	对齐能否提升而非削弱能力？
可解释性 ↔ 性能	机制性洞察能否促成更好的架构？
规模 ↔ 可访问性	小模型能否复制涌现行为？

选择你的研究领域
列出前 3-5 个期望目标（每个人都想要的东西）
识别通常被视为权衡关系的配对
对于每一对，提问：这种权衡是根本性的还是当前方法的产物？
如果是产物 → 调和本身就是你的研究贡献
如果是根本性的 → 描述帕累托前沿本身就有价值

我是否确认了这种张力是真实的（而不仅仅是假设的）？
我能指出独立优化每一方的论文吗？
我提出的调和方案在技术上是否可行，而不仅仅是愿望？

4. 交叉融合（类比迁移）

从其他学科借鉴结构性想法是最具生成性的研究启发法之一。许多基础技术就是这样出现的——注意力机制借鉴了认知科学，遗传算法借鉴了生物学，对抗训练借鉴了博弈论。

有效类比的要求：

结构保真度：映射必须在底层机制层面成立，而不仅仅是表面相似
非显而易见的联系：如果联系是众所周知的，新颖性就消失了
可检验的预测：类比应该产生具体的假设

对 ML 研究高产出的源领域：

源领域	可迁移概念
神经科学	注意力、记忆巩固、分层处理
物理学	基于能量的模型、相变、重整化
经济学	机制设计、拍卖理论、激励对齐
生态学	种群动态、生态位竞争、共同进化
语言学	组合性、语用学、语法归纳
控制理论	反馈循环、稳定性、自适应调节

用与领域无关的语言描述你的问题（去掉行话）
提问：其他哪个领域解决了结构上类似的问题？
在机制层面研究该领域的解决方案
将解决方案映射回你的领域，保持结构关系
从类比中产生可检验的预测
验证：借鉴的想法是否确实改善了结果？

5. “什么改变了？”原则

强有力的想法通常来自在新条件下重新审视旧问题。硬件、规模、数据可用性或法规方面的进步可能使先前的假设失效，并使以前不切实际的方法变得可行。

需要监控的变化类别：

变化类型	示例	研究意义
计算	GPU 速度提升 10 倍	曾被斥为过于昂贵的方法变得可行
规模	万亿令牌数据集	在小规模下失败的统计论证现在可能成立
法规	欧盟 AI 法案、GDPR	创造了对合规替代方案的需求
工具	新框架、API	降低了复杂方法的实现门槛
失败	高调的系统故障	暴露了现有方法的空白
文化	新的用户行为	改变了哪些问题最重要

选择一个知名的负面结果或被放弃的方法（3-10 年前）
列出导致其被拒绝的假设
对于每个假设，提问：这在今天仍然成立吗？
如果任何假设已失效 → 在新条件下重新运行该想法
构建贡献框架：“X 以前不切实际是因为 Y，但 Z 已经改变”

6. 失败分析与边界探测

理解一个方法在何处失效通常与展示它在何处有效同样有价值。边界探测系统地揭示了公认技术失效的条件。

需要探测的边界类型：

分布：对于分布外输入会发生什么？
规模：在典型规模的 10 倍或 0.1 倍时，方法是否退化？
对抗性：该方法能否被故意破坏？
组合性：当组合多种能力时，性能是否保持？
时间性：方法是否会随时间退化（概念漂移）？

选择一个被广泛使用且报告结果很强的方法
识别其评估中的隐含假设（数据集、规模、领域）
系统地违反每个假设
记录方法在何处以及如何失效
诊断每个失效的根本原因
提出修复方案或解释失效为何是根本性的

我是在探测真正的边界，还是仅仅确认已知的局限性？
我能解释方法为何失效，而不仅仅是它失效了吗？
我的分析是否提出了建设性的前进路径？

在接受复杂性之前，先问一个更简单的方法是否足够。领域有时会过度关注复杂的解决方案，而一个精简的基线表现却很有竞争力。

不必要复杂性的警告信号：

该方法有许多超参数，且最优范围很窄
消融研究表明大多数组件贡献微乎其微
一个简单的基线从未被适当调整或评估
在大多数基准测试上，相对于基线的改进在噪声范围内

确定你当前问题的 SOTA 方法
将其剥离到最简单的核心（那个关键想法是什么？）
通过仔细的工程构建那个最小版本
公平比较：相同的计算预算，相同的调整努力
如果差距很小 → 贡献点就是简洁性本身
如果差距很大 → 你现在理解了复杂性带来了什么

“我们展示了[简单方法]加上[一个修改]匹配了[复杂 SOTA]”
“我们识别出[特定组件]是关键驱动因素，而非[其他组件]”

8. 利益相关者轮换

从多个视角审视一个系统，可以揭示不同类别的研究问题。每个利益相关者看到不同的摩擦、风险和机会。

利益相关者视角：

利益相关者	关键问题
最终用户	这个可用吗？哪些错误是不可接受的？延迟容忍度是多少？
开发者	这个可调试吗？维护负担是什么？如何组合？
理论家	为什么这有效？形式化保证是什么？空白在哪里？
对抗者	这如何被利用？攻击面是什么？
伦理学家	谁受到伤害？嵌入了哪些偏见？谁被排除在外？
监管者	这个可审计吗？决策可以解释吗？有问责制吗？
运营者	成本是多少？如何扩展？故障模式是什么？

用一段话描述你的系统或方法
依次假设每个利益相关者视角（每个角色花 5 分钟）
对于每个视角，列出前 3 个关注点或问题
识别哪些关注点是现有工作未解决的
影响最广泛的未解决关注点就是你的研究问题

新颖性通常来自重组或模块化。创新往往不在于新的原语，而在于组件如何排列或分离。

组合（结合现有技术）：

识别解决互补子问题的两种方法
提问：结合它们会产生什么涌现能力？
示例：RAG + 思维链 → 检索增强推理

分解（拆解整体系统）：

识别一个组件纠缠的复杂系统
提问：哪个组件是实际的瓶颈？
示例：将“微调”分解为数据选择、优化和正则化，揭示出数据选择通常最重要

列出你所在领域的 5-10 个关键组件或技术
组合：挑选成对组合，并询问结合它们会发生什么
分解：挑选一个复杂方法，并隔离每个组件的贡献
对于组合：组合是否创造了涌现能力？
对于分解：隔离是否揭示了一个主导或冗余的组件？

10. “向他人解释”测试

一个强大的研究想法应该能用两句话向聪明的非专家解释清楚。这个测试强制要求目的清晰，并强化价值主张。

两句话模板：

第一句（问题）：“[领域] 目前在 [具体问题] 方面存在困难，这很重要，因为 [具体后果]。” 第二句（洞察）：“我们通过 [关键机制] 采用 [方法]，这有效是因为 [原因]。”

如果你无法填写此模板：

问题可能尚未明确定义 → 返回框架 1
洞察可能尚不清晰 → 返回框架 7（简化）
重要性可能尚未确立 → 返回框架 3（寻找张力）

你所在子领域之外的聪明同事能理解为什么这很重要吗？
解释能否在没有行话的情况下成立？
你能预测怀疑论者的第一个反对意见是什么吗？

综合头脑风暴工作流程

使用这个端到端的工作流程，从空白页到排名研究想法。

阶段 1：发散（生成候选想法）

目标：产生 10-20 个候选想法，无需过滤。

扫描张力（框架 3）：列出你所在领域的 5 个权衡
检查变化（框架 5）：列出 3 个最近的转变（计算、数据、法规）
探测边界（框架 6）：挑选 2 个流行方法，找出它们失效的地方
交叉融合（框架 4）：从相邻领域挑选 1 个想法
组合/分解（框架 9）：结合 2 种现有技术或拆分 1 种
攀登抽象阶梯（框架 2）：为每个候选想法生成上/下/横向变体

阶段 2：收敛（筛选和排序）

目标：缩小到 3-5 个最强的想法。

对每个候选想法应用以下过滤器：

过滤器	问题	淘汰标准
解释测试（F10）	我能用两句话陈述这个吗？	如果不能 → 想法尚不清晰
问题先行检查（F1）	问题是真实且重要的吗？	如果没有人因此受困扰 → 放弃
简洁性测试（F7）	复杂性是否合理？	如果更简单的方法有效 → 简化或放弃
利益相关者检查（F8）	谁受益？谁可能反对？	如果没有明确的受益者 → 放弃
可行性	我能否用可用资源执行这个？	如果明显不可行 → 暂存以备后用

阶段 3：精炼（锐化优胜者）

目标：将最佳想法转化为具体的研究计划。

写出两句话的推介（框架 10）
确定要解决的核心张力（框架 3）
指定抽象层次（框架 2）
列出 3 个能验证该想法的具体实验
预测最强烈的反对意见并准备回应
定义一个为期 2 周的试点，以提供可行性信号

两句话推介清晰且有说服力
问题是真实的（通过问题先行检查）
方法是合理的（通过简洁性测试）
至少有一个利益相关者明确受益
核心实验已指定
可行性试点已定义
最强烈的反对意见已有回应

不确定从哪个框架开始？使用此决策指南：

你的情况	从以下开始
“我不知道该研究哪个领域”	寻找张力（F3） → 什么改变了（F5）
“我有一个模糊的领域但没有具体想法”	抽象阶梯（F2） → 失败分析（F6）
“我有一个想法但不确定它好不好”	解释测试（F10） → 简洁性测试（F7）
“我有一个好想法但需要新角度”	交叉融合（F4） → 利益相关者轮换（F8）
“我想将现有工作组合成新东西”	组合/分解（F9）
“我找到了一个很酷的技术并想应用它”	问题先行检查（F1） → 利益相关者轮换（F8）
“我想挑战传统智慧”	失败分析（F6） → 简洁性测试（F7）

研究构思中的常见陷阱

陷阱	症状	解决方法
有新颖性但无影响力	“没有人做过 X”，但没有人需要 X	应用问题先行检查（F1）
默认渐进式	想法在基准测试上提升 2%	攀登抽象阶梯（F2）
复杂性崇拜	方法有 8 个组件，每个贡献微乎其微	应用简洁性测试（F7）
回音室效应	所有想法都来自阅读相同的 10 篇论文	使用交叉融合（F4）
过时的假设	“这以前试过，没成功”（5 年前）	应用什么改变了（F5）
单一视角偏见	只考虑 ML 工程师的观点	使用利益相关者轮换（F8）
过早收敛	未探索替代方案就致力于第一个想法	运行完整发散阶段

智能体使用说明

当研究者请求帮助头脑风暴研究想法时：

识别他们的起点：他们是在探索新领域、在当前项目中停滞，还是在评估现有想法？
选择合适的框架：使用框架选择指南挑选 2-3 个相关视角
交互式地逐步讲解框架：逐步应用每个框架，要求研究者提供特定领域的输入
生成候选想法：目标是在各个框架下产生 10-20 个原始想法
筛选和排序：应用收敛阶段的过滤器，缩小到前 3-5 个
精炼优胜者：帮助阐明两句话推介并定义具体的后续步骤

推动具体化——模糊的想法（“提高效率”）不可操作
挑战假设——至少问三次“为什么？”
保留所有候选想法的书面列表，即使是被拒绝的（它们以后可能会重新组合）
研究者最终决定追求哪些想法；智能体促进结构化思考

🇺🇸English

Research Idea Brainstorming

Structured frameworks for discovering the next research idea. This skill provides ten complementary ideation lenses that help researchers move from vague curiosity to concrete, defensible research proposals. Each framework targets a different cognitive mode—use them individually or combine them for comprehensive exploration.

When to Use This Skill

Starting a new research direction and need structured exploration
Feeling stuck on a current project and want fresh angles
Evaluating whether a half-formed idea has real potential
Preparing for a brainstorming session with collaborators
Transitioning between research areas and seeking high-leverage entry points
Reviewing a field and looking for underexplored gaps

Do NOT use this skill when :

You already have a well-defined research question and need execution guidance
You need help with experimental design or methodology (use domain-specific skills)
You want a literature review (use scientific-skills:literature-review)

Core Ideation Frameworks

1. Problem-First vs. Solution-First Thinking

Research ideas originate from two distinct modes. Knowing which mode you are in prevents a common failure: building solutions that lack real problems, or chasing problems without feasible approaches.

Problem-First (pain point → method):

Start with a concrete failure, bottleneck, or unmet need
Naturally yields impactful work because the motivation is intrinsic
Risk: may converge on incremental fixes rather than paradigm shifts

Solution-First (new capability → application):

Start with a new tool, insight, or technique seeking application
Often drives breakthroughs by unlocking previously impossible approaches
Risk: "hammer looking for a nail"—solution may lack genuine demand

Workflow :

Write down your idea in one sentence
Classify it: Is this problem-first or solution-first?
If problem-first → verify the problem matters (who suffers? how much?)
If solution-first → identify at least two genuine problems it addresses
For either mode, articulate the gap: what cannot be done today that this enables?

Self-Check :

Can I name a specific person or community who needs this?
Is the problem I am solving actually unsolved (not just under-marketed)?
If solution-first, does the solution create new capability or just replicate existing ones?

2. The Abstraction Ladder

Every research problem sits at a particular level of abstraction. Deliberately moving up or down the ladder reveals ideas invisible at your current level.

Direction	Action	Outcome
Move Up (generalize)	Turn a specific result into a broader principle	Framework papers, theoretical contributions
Move Down (instantiate)	Test a general paradigm under concrete constraints	Empirical papers, surprising failure analyses
Move Sideways (analogize)	Apply same abstraction level to adjacent domain	Cross-pollination, transfer papers

Workflow :

State your current research focus in one sentence
Move UP: What is the general principle behind this? What class of problems does this belong to?
Move DOWN: What is the most specific, constrained instance of this? What happens at the extreme?
Move SIDEWAYS: Where else does this pattern appear in a different field?
For each new level, ask: Is this a publishable contribution on its own?

Example :

Current : "Improving retrieval accuracy for RAG systems"
Up : "What makes context selection effective for any augmented generation system?"
Down : "How does retrieval accuracy degrade when documents are adversarially perturbed?"
Sideways : "Database query optimization uses similar relevance ranking—what can we borrow?"

3. Tension and Contradiction Hunting

Breakthroughs often come from resolving tensions between widely accepted but seemingly conflicting goals. These contradictions are not bugs—they are the research opportunity.

Common Research Tensions :

Tension Pair	Research Opportunity
Performance ↔ Efficiency	Can we match SOTA with 10x less compute?
Privacy ↔ Utility	Can federated/encrypted methods close the accuracy gap?
Generality ↔ Specialization	When does fine-tuning beat prompting, and why?
Safety ↔ Capability	Can alignment improve rather than tax capability?
Interpretability ↔ Performance	Do mechanistic insights enable better architectures?
Scale ↔ Accessibility	Can small models replicate emergent behaviors?

Workflow :

Pick your research area
List the top 3-5 desiderata (things everyone wants)
Identify pairs that are commonly treated as trade-offs
For each pair, ask: Is this trade-off fundamental or an artifact of current methods?
If artifact → the reconciliation IS your research contribution
If fundamental → characterizing the Pareto frontier is itself valuable

Self-Check :

Have I confirmed this tension is real (not just assumed)?
Can I point to papers that optimize for each side independently?
Is my proposed reconciliation technically plausible, not just aspirational?

4. Cross-Pollination (Analogy Transfer)

Borrowing structural ideas from other disciplines is one of the most generative research heuristics. Many foundational techniques emerged this way—attention mechanisms draw from cognitive science, genetic algorithms from biology, adversarial training from game theory.

Requirements for a Valid Analogy :

Structural fidelity : The mapping must hold at the level of underlying mechanisms, not just surface similarity
Non-obvious connection : If the link is well-known, the novelty is gone
Testable predictions : The analogy should generate concrete hypotheses

High-Yield Source Fields for ML Research :

Source Field	Transferable Concepts
Neuroscience	Attention, memory consolidation, hierarchical processing
Physics	Energy-based models, phase transitions, renormalization
Economics	Mechanism design, auction theory, incentive alignment
Ecology	Population dynamics, niche competition, co-evolution
Linguistics	Compositionality, pragmatics, grammatical induction
Control Theory	Feedback loops, stability, adaptive regulation

Workflow :

Describe your problem in domain-agnostic language (strip the jargon)
Ask: What other field solves a structurally similar problem?
Study that field's solution at the mechanism level
Map the solution back to your domain, preserving structural relationships
Generate testable predictions from the analogy
Validate: Does the borrowed idea actually improve outcomes?

5. The "What Changed?" Principle

Strong ideas often come from revisiting old problems under new conditions. Advances in hardware, scale, data availability, or regulations can invalidate prior assumptions and make previously impractical approaches viable.

Categories of Change to Monitor :

Change Type	Example	Research Implication
Compute	GPUs 10x faster	Methods dismissed as too expensive become feasible
Scale	Trillion-token datasets	Statistical arguments that failed at small scale may now hold
Regulation	EU AI Act, GDPR	Creates demand for compliant alternatives
Tooling	New frameworks, APIs	Reduces implementation barrier for complex methods
Failure	High-profile system failures	Exposes gaps in existing approaches
Cultural	New user behaviors	Shifts what problems matter most

Workflow :

Pick a well-known negative result or abandoned approach (3-10 years old)
List the assumptions that led to its rejection
For each assumption, ask: Is this still true today?
If any assumption has been invalidated → re-run the idea under new conditions
Frame the contribution: "X was previously impractical because Y, but Z has changed"

6. Failure Analysis and Boundary Probing

Understanding where a method breaks is often as valuable as showing where it works. Boundary probing systematically exposes the conditions under which accepted techniques fail.

Types of Boundaries to Probe :

Distributional : What happens with out-of-distribution inputs?
Scale : Does the method degrade at 10x or 0.1x the typical scale?
Adversarial : Can the method be deliberately broken?
Compositional : Does performance hold when combining multiple capabilities?
Temporal : Does the method degrade over time (concept drift)?

Workflow :

Select a widely-used method with strong reported results
Identify the implicit assumptions in its evaluation (dataset, scale, domain)
Systematically violate each assumption
Document where and how the method breaks
Diagnose the root cause of each failure
Propose a fix or explain why the failure is fundamental

Self-Check :

Am I probing genuine boundaries, not just confirming known limitations?
Can I explain WHY the method fails, not just THAT it fails?
Does my analysis suggest a constructive path forward?

7. The Simplicity Test

Before accepting complexity, ask whether a simpler approach suffices. Fields sometimes over-index on elaborate solutions when a streamlined baseline performs competitively.

Warning Signs of Unnecessary Complexity :

The method has many hyperparameters with narrow optimal ranges
Ablations show most components contribute marginally
A simple baseline was never properly tuned or evaluated
The improvement over baselines is within noise on most benchmarks

Workflow :

Identify the current SOTA method for your problem
Strip it to its simplest possible core (what is the one key idea?)
Build that minimal version with careful engineering
Compare fairly: same compute budget, same tuning effort
If the gap is small → the contribution is the simplicity itself
If the gap is large → you now understand what the complexity buys

Contribution Framing :

"We show that [simple method] with [one modification] matches [complex SOTA]"
"We identify [specific component] as the critical driver, not [other components]"

8. Stakeholder Rotation

Viewing a system from multiple perspectives reveals distinct classes of research questions. Each stakeholder sees different friction, risk, and opportunity.

Stakeholder Perspectives :

Stakeholder	Key Questions
End User	Is this usable? What errors are unacceptable? What is the latency tolerance?
Developer	Is this debuggable? What is the maintenance burden? How does it compose?
Theorist	Why does this work? What are the formal guarantees? Where are the gaps?
Adversary	How can this be exploited? What are the attack surfaces?
Ethicist	Who is harmed? What biases are embedded? Who is excluded?
Regulator	Is this auditable? Can decisions be explained? Is there accountability?
Operator	What is the cost? How does it scale? What is the failure mode?

Workflow :

Describe your system or method in one paragraph
Assume each stakeholder perspective in turn (spend 5 minutes per role)
For each perspective, list the top 3 concerns or questions
Identify which concerns are unaddressed by existing work
The unaddressed concern with the broadest impact is your research question

9. Composition and Decomposition

Novelty often emerges from recombination or modularization. Innovation frequently lies not in new primitives, but in how components are arranged or separated.

Composition (combining existing techniques):

Identify two methods that solve complementary subproblems
Ask: What emergent capability arises from combining them?
Example: RAG + Chain-of-Thought → retrieval-augmented reasoning

Decomposition (breaking apart monolithic systems):

Identify a complex system with entangled components
Ask: Which component is the actual bottleneck?
Example: Decomposing "fine-tuning" into data selection, optimization, and regularization reveals that data selection often matters most

Workflow :

List the 5-10 key components or techniques in your area
Compose : Pick pairs and ask what happens when you combine them
Decompose : Pick a complex method and isolate each component's contribution
For compositions: Does the combination create emergent capabilities?
For decompositions: Does isolation reveal a dominant or redundant component?

10. The "Explain It to Someone" Test

A strong research idea should be defensible in two sentences to a smart non-expert. This test enforces clarity of purpose and sharpens the value proposition.

The Two-Sentence Template :

Sentence 1 (Problem): "[Domain] currently struggles with [specific problem], which matters because [concrete consequence]." Sentence 2 (Insight): "We [approach] by [key mechanism], which works because [reason]."

If You Cannot Fill This Template :

The problem may not be well-defined yet → return to Framework 1
The insight may not be clear yet → return to Framework 7 (simplify)
The significance may not be established → return to Framework 3 (find the tension)

Calibration Questions :

Would a smart colleague outside your subfield understand why this matters?
Does the explanation stand without jargon?
Can you predict what a skeptic's first objection would be?

Integrated Brainstorming Workflow

Use this end-to-end workflow to go from blank page to ranked research ideas.

Phase 1: Diverge (Generate Candidates)

Goal : Produce 10-20 candidate ideas without filtering.

Scan for tensions (Framework 3): List 5 trade-offs in your field
Check what changed (Framework 5): List 3 recent shifts (compute, data, regulation)
Probe boundaries (Framework 6): Pick 2 popular methods and find where they break
Cross-pollinate (Framework 4): Pick 1 idea from an adjacent field
Compose/decompose (Framework 9): Combine 2 existing techniques or split 1 apart
Climb the abstraction ladder (Framework 2): For each candidate, generate up/down/sideways variants

Phase 2: Converge (Filter and Rank)

Goal : Narrow to 3-5 strongest ideas.

Apply these filters to each candidate:

Filter	Question	Kill Criterion
Explain-It Test (F10)	Can I state this in two sentences?	If no → idea is not yet clear
Problem-First Check (F1)	Is the problem genuine and important?	If no one suffers from this → drop it
Simplicity Test (F7)	Is the complexity justified?	If a simpler approach works → simplify or drop
Stakeholder Check (F8)	Who benefits? Who might object?	If no clear beneficiary → drop it
Feasibility	Can I execute this with available resources?	If clearly infeasible → park it for later

Phase 3: Refine (Sharpen the Winner)

Goal : Turn the top idea into a concrete research plan.

Write the two-sentence pitch (Framework 10)
Identify the core tension being resolved (Framework 3)
Specify the abstraction level (Framework 2)
List 3 concrete experiments that would validate the idea
Anticipate the strongest objection and prepare a response
Define a 2-week pilot that would provide signal on feasibility

Completion Checklist :

Two-sentence pitch is clear and compelling
Problem is genuine (problem-first check passed)
Approach is justified (simplicity test passed)
At least one stakeholder clearly benefits
Core experiments are specified
Feasibility pilot is defined
Strongest objection has a response

Framework Selection Guide

Not sure which framework to start with? Use this decision guide:

Your Situation	Start With
"I don't know what area to work in"	Tension Hunting (F3) → What Changed (F5)
"I have a vague area but no specific idea"	Abstraction Ladder (F2) → Failure Analysis (F6)
"I have an idea but I'm not sure it's good"	Explain-It Test (F10) → Simplicity Test (F7)
"I have a good idea but need a fresh angle"	Cross-Pollination (F4) → Stakeholder Rotation (F8)
"I want to combine existing work into something new"	Composition/Decomposition (F9)
"I found a cool technique and want to apply it"	Problem-First Check (F1) → Stakeholder Rotation (F8)
"I want to challenge conventional wisdom"	Failure Analysis (F6) → Simplicity Test (F7)

Common Pitfalls in Research Ideation

Pitfall	Symptom	Fix
Novelty without impact	"No one has done X" but no one needs X	Apply Problem-First Check (F1)
Incremental by default	Idea is +2% on a benchmark	Climb the Abstraction Ladder (F2)
Complexity worship	Method has 8 components, each helping marginally	Apply Simplicity Test (F7)
Echo chamber	All ideas come from reading the same 10 papers	Use Cross-Pollination (F4)
Stale assumptions	"This was tried and didn't work" (5 years ago)	Apply What Changed (F5)
Single-perspective bias	Only considering the ML engineer's view	Use Stakeholder Rotation (F8)
Premature convergence	Committed to first idea without exploring alternatives

Usage Instructions for Agents

When a researcher asks for help brainstorming research ideas:

Identify their starting point : Are they exploring a new area, stuck on a current project, or evaluating an existing idea?
Select appropriate frameworks : Use the Framework Selection Guide to pick 2-3 relevant lenses
Walk through frameworks interactively : Apply each framework step-by-step, asking the researcher for domain-specific inputs
Generate candidates : Aim for 10-20 raw ideas across frameworks
Filter and rank : Apply the Converge phase filters to narrow to top 3-5
Refine the winner : Help articulate the two-sentence pitch and define concrete next steps

Key Principles :

Push for specificity—vague ideas ("improve efficiency") are not actionable
Challenge assumptions—ask "why?" at least three times
Maintain a written list of all candidates, even rejected ones (they may recombine later)
The researcher makes the final call on which ideas to pursue; the agent facilitates structured thinking

Weekly Installs

Repository

orchestra-resea…h-skills

GitHub Stars

5.6K

First Seen

Feb 23, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

Installed on

opencode72

gemini-cli70

github-copilot70

amp70

codex70

kimi-cli70

AI 代码实施计划编写技能 | 自动化开发任务分解与 TDD 流程规划工具

50,900 周安装

研究思路头脑风暴：10个结构化框架助你生成AI/机器学习创新研究提案

🇨🇳中文介绍

研究思路头脑风暴

何时使用此技能

核心构思框架

1. 问题先行与方案先行思维

相关 Skills

2. 抽象阶梯

3. 寻找张力与矛盾

4. 交叉融合（类比迁移）

5. “什么改变了？”原则

6. 失败分析与边界探测

7. 简洁性测试

8. 利益相关者轮换

9. 组合与分解

10. “向他人解释”测试

综合头脑风暴工作流程

阶段 1：发散（生成候选想法）

阶段 2：收敛（筛选和排序）

阶段 3：精炼（锐化优胜者）

框架选择指南

研究构思中的常见陷阱

智能体使用说明

🇺🇸English

Research Idea Brainstorming

When to Use This Skill

Core Ideation Frameworks

1. Problem-First vs. Solution-First Thinking

2. The Abstraction Ladder

3. Tension and Contradiction Hunting

4. Cross-Pollination (Analogy Transfer)

5. The "What Changed?" Principle

6. Failure Analysis and Boundary Probing

7. The Simplicity Test

8. Stakeholder Rotation

9. Composition and Decomposition

10. The "Explain It to Someone" Test

Integrated Brainstorming Workflow

Phase 1: Diverge (Generate Candidates)

Phase 2: Converge (Filter and Rank)

Phase 3: Refine (Sharpen the Winner)

Framework Selection Guide

Common Pitfalls in Research Ideation

Usage Instructions for Agents

最新 Skills