Knowledge Base Manager by daffy0208/ai-dev-standards
npx skills add https://github.com/daffy0208/ai-dev-standards --skill 'Knowledge Base Manager'为人工智能系统和人类使用构建和维护高质量的知识库。
知识库 = 结构化信息 + 质量策展 + 可访问性
知识库不仅仅是数据转储——它是经过策展、验证、版本化的信息,旨在回答问题并支持推理。
定义: 文档集合,经过分块和嵌入以进行语义搜索
最适合:
优势:
劣势:
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
使用: rag-implementer 技能 + vector-database-mcp
定义: 通过关系连接的实体(人、地点、事物)网络
最适合:
优势:
劣势:
使用: knowledge-graph-builder 技能 + graph-database-mcp
定义: 用于非结构化知识的文档 + 用于结构化实体/关系的图谱
最适合:
优势:
劣势:
使用: 同时使用 rag-implementer + knowledge-graph-builder 技能
What kind of knowledge do you have?
├─ Mostly unstructured text (docs, articles, content)?
│ └─ Document-Based KB (RAG)
│ Use: rag-implementer skill
│
├─ Mostly structured entities with relationships?
│ └─ Entity-Based KB (Graph)
│ Use: knowledge-graph-builder skill
│
└─ Mix of both?
└─ Hybrid KB (RAG + Graph)
Use: Both skills + This skill for integration
目标: 了解现有知识及其结构方式
行动:
清点现有知识来源
对知识类型进行分类
选择知识库架构
定义知识模式
验证:
目标: 将原始信息转化为高质量知识
行动:
从来源中提取知识
清理和规范化
结构化知识
添加来源信息
策展最佳实践:
验证:
目标: 实现知识访问的技术基础设施
架构模式:
对于基于文档的知识库:
// Vector database for semantic search
interface DocumentKB {
store: 'Pinecone' | 'Weaviate' | 'pgvector'
chunks: {
content: string
embedding: number[]
metadata: {
source: string
title: string
updated_at: string
category: string
}
}[]
}
对于基于实体的知识库:
// Graph database for relationship queries
interface EntityKB {
store: 'Neo4j' | 'ArangoDB'
nodes: {
id: string
type: 'Person' | 'Organization' | 'Product' | 'Concept'
properties: Record<string, any>
}[]
relationships: {
from: string
to: string
type: string
properties: Record<string, any>
}[]
}
对于混合知识库:
// Both vector DB + graph DB
interface HybridKB {
vectorDB: DocumentKB
graphDB: EntityKB
linker: {
// Links documents to entities mentioned in them
linkDocumentToEntities(docId: string): string[]
// Links entities to documents that mention them
linkEntityToDocuments(entityId: string): string[]
}
}
行动:
选择数据库
实现搜索/查询层
添加缓存和优化
验证:
目标: 确保知识库的准确性和可靠性
质量指标:
验证策略:
1. 测试问题集 创建 100+ 个已知正确答案的测试问题:
interface TestQuestion {
question: string
expected_answer: string
category: string
difficulty: 'easy' | 'medium' | 'hard'
}
2. 人工审查
3. 自动化检查
4. 持续监控
interface KBHealthMetrics {
accuracy_score: number // 0-100
coverage_score: number // % questions answered
freshness_score: number // avg days since update
consistency_score: number // % no conflicts
user_satisfaction: number // feedback rating
}
行动:
验证:
目标: 追踪知识随时间的变化并支持回滚
版本控制的重要性:
版本控制策略:
1. 快照版本控制
interface KnowledgeEntry {
id: string
content: string
version: number
created_at: string
updated_at: string
updated_by: string
changelog: string
previous_version?: string // ID of prior version
}
2. 事件溯源
interface KnowledgeEvent {
event_id: string
entity_id: string
event_type: 'created' | 'updated' | 'deleted'
timestamp: string
changes: {
field: string
old_value: any
new_value: any
}[]
author: string
}
3. Git 风格版本控制
行动:
验证:
目标: 长期保持知识库健康
维护任务:
每日:
每周:
每月:
每季度:
治理框架:
1. 角色与职责
2. 变更流程
Submit → Review → Approve → Publish → Monitor
3. 质量标准
行动:
验证:
问题: 摄取所有内容而不进行质量过滤
影响: 信噪比低,搜索结果差,用户沮丧
解决方案: 在摄取前进行策展。质量 > 数量
问题: 知识变化但未追踪历史
影响: 无法审计变更,无法回滚错误,没有问责制
解决方案: 从阶段 5 开始实施版本控制
问题: 知识库过时但无人知晓
影响: AI 系统使用旧事实产生幻觉,用户得到错误答案
解决方案: 新鲜度监控 + 计划更新
问题: 相同事实出现在多个地方,变得不一致
影响: 答案冲突,用户困惑
解决方案: 去重 + 单一事实来源
问题: 知识没有来源引用
影响: 无法验证准确性,无法追踪错误
解决方案: 始终追踪来源 + 时间戳 + 作者
问题: 没有用户验证,知识库不符合需求
解决方案: 从用户研究开始,持续验证
问题: 等待知识库"完美"才发布
解决方案: 以 80% 覆盖率发布,根据使用情况迭代
问题: 构建复杂的混合系统,而简单的文档就能满足需求
解决方案: 从简单开始,仅在需要时增加复杂性
问题: 一次性构建,永不更新
解决方案: 从第一天起就建立维护计划
开始前:
rag-implementerknowledge-graph-builder阶段 1 - 架构设计(第 1 周):
阶段 2 - 初始构建(第 2-3 周):
阶段 3 - 迭代(持续进行):
rag-implementer, knowledge-graph-builder, data-engineer, quality-auditorvector-database-mcp, graph-database-mcp, knowledge-base-mcp, semantic-search-mcpSTANDARDS/architecture-patterns/rag-pattern.md, knowledge-base-pattern.md (即将推出)INTEGRATIONS/pinecone/, INTEGRATIONS/graph-databases/neo4j/请记住: 知识库的质量取决于其策展水平。从第一天起就投资于质量,建立维护流程,并根据用户反馈进行迭代。目标不是拥有所有知识——而是拥有正确的知识,组织良好,易于访问。
每周安装数
0
代码仓库
GitHub 星标数
18
首次出现
Jan 1, 1970
安全审计
Build and maintain high-quality knowledge bases for AI systems and human consumption.
Knowledge Base = Structured Information + Quality Curation + Accessibility
A knowledge base is not just a data dump—it's curated, validated, versioned information designed to answer questions and enable reasoning.
What it is: Collection of documents, chunked and embedded for semantic search
Best for:
Strengths:
Weaknesses:
Use: rag-implementer skill + vector-database-mcp
What it is: Network of entities (people, places, things) connected by relationships
Best for:
Strengths:
Weaknesses:
Use: knowledge-graph-builder skill + graph-database-mcp
What it is: Documents for unstructured knowledge + Graph for structured entities/relationships
Best for:
Strengths:
Weaknesses:
Use: Both rag-implementer + knowledge-graph-builder skills
What kind of knowledge do you have?
├─ Mostly unstructured text (docs, articles, content)?
│ └─ Document-Based KB (RAG)
│ Use: rag-implementer skill
│
├─ Mostly structured entities with relationships?
│ └─ Entity-Based KB (Graph)
│ Use: knowledge-graph-builder skill
│
└─ Mix of both?
└─ Hybrid KB (RAG + Graph)
Use: Both skills + This skill for integration
Goal : Understand what knowledge exists and how to structure it
Actions :
Inventory existing knowledge sources
Classify knowledge types
Choose KB architecture
Define knowledge schema
Validation :
Goal : Transform raw information into high-quality knowledge
Actions :
Extract knowledge from sources
Clean and normalize
Structure knowledge
Add provenance
Curation Best Practices :
Validation :
Goal : Implement technical infrastructure for knowledge access
Architecture Patterns :
For Document-Based KB:
// Vector database for semantic search
interface DocumentKB {
store: 'Pinecone' | 'Weaviate' | 'pgvector'
chunks: {
content: string
embedding: number[]
metadata: {
source: string
title: string
updated_at: string
category: string
}
}[]
}
For Entity-Based KB:
// Graph database for relationship queries
interface EntityKB {
store: 'Neo4j' | 'ArangoDB'
nodes: {
id: string
type: 'Person' | 'Organization' | 'Product' | 'Concept'
properties: Record<string, any>
}[]
relationships: {
from: string
to: string
type: string
properties: Record<string, any>
}[]
}
For Hybrid KB:
// Both vector DB + graph DB
interface HybridKB {
vectorDB: DocumentKB
graphDB: EntityKB
linker: {
// Links documents to entities mentioned in them
linkDocumentToEntities(docId: string): string[]
// Links entities to documents that mention them
linkEntityToDocuments(entityId: string): string[]
}
}
Actions :
Choose database(s)
Implement search/query layer
Add caching and optimization
Validation :
Goal : Ensure knowledge base accuracy and reliability
Quality Metrics :
Validation Strategies :
1. Test Question Sets Create 100+ test questions with known correct answers:
interface TestQuestion {
question: string
expected_answer: string
category: string
difficulty: 'easy' | 'medium' | 'hard'
}
2. Human Review
3. Automated Checks
4. Continuous Monitoring
interface KBHealthMetrics {
accuracy_score: number // 0-100
coverage_score: number // % questions answered
freshness_score: number // avg days since update
consistency_score: number // % no conflicts
user_satisfaction: number // feedback rating
}
Actions :
Validation :
Goal : Track knowledge changes over time and enable rollback
Why Versioning Matters :
Versioning Strategies :
1. Snapshot Versioning
interface KnowledgeEntry {
id: string
content: string
version: number
created_at: string
updated_at: string
updated_by: string
changelog: string
previous_version?: string // ID of prior version
}
2. Event Sourcing
interface KnowledgeEvent {
event_id: string
entity_id: string
event_type: 'created' | 'updated' | 'deleted'
timestamp: string
changes: {
field: string
old_value: any
new_value: any
}[]
author: string
}
3. Git-Style Versioning
Actions :
Validation :
Goal : Keep knowledge base healthy long-term
Maintenance Tasks :
Daily:
Weekly:
Monthly:
Quarterly:
Governance Framework :
1. Roles & Responsibilities
2. Change Process
Submit → Review → Approve → Publish → Monitor
3. Quality Standards
Actions :
Validation :
Problem : Ingesting everything without quality filtering
Impact : Low signal-to-noise ratio, poor search results, user frustration
Solution : Curate before ingesting. Quality > Quantity
Problem : Knowledge changes but no history tracked
Impact : Can't audit changes, can't rollback errors, no accountability
Solution : Implement versioning from Phase 5
Problem : Knowledge base outdated but no one knows
Impact : AI systems hallucinate using old facts, users get wrong answers
Solution : Freshness monitoring + scheduled updates
Problem : Same fact in multiple places, becomes inconsistent
Impact : Conflicting answers, confused users
Solution : Deduplication + single source of truth
Problem : Knowledge without source citations
Impact : Can't verify accuracy, can't trace errors
Solution : Always track source + timestamp + author
Problem : No user validation, KB doesn't meet needs
Solution : Start with user research, validate continuously
Problem : Waiting to launch until KB is "perfect"
Solution : Launch with 80% coverage, iterate based on usage
Problem : Building complex hybrid system when simple docs would work
Solution : Start simple, add complexity only when needed
Problem : Build once, never update
Solution : Establish maintenance schedule from day 1
Before you start:
rag-implementer if using document KBknowledge-graph-builder if using entity KBPhase 1 - Architecture (Week 1):
Phase 2 - Initial Build (Week 2-3):
Phase 3 - Iterate (Ongoing):
rag-implementer, knowledge-graph-builder, data-engineer, quality-auditorvector-database-mcp, graph-database-mcp, knowledge-base-mcp, semantic-search-mcpSTANDARDS/architecture-patterns/rag-pattern.md, knowledge-base-pattern.md (coming soon)Remember : A knowledge base is only as good as its curation. Invest in quality from day 1, establish maintenance processes, and iterate based on user feedback. The goal is not to have all knowledge—it's to have the right knowledge, well-organized, and easily accessible.
Weekly Installs
0
Repository
GitHub Stars
18
First Seen
Jan 1, 1970
Security Audits
超能力技能使用指南:AI助手技能调用优先级与工作流程详解
45,100 周安装
前端动画设计指南:提升用户体验的微交互与动效策略
38,600 周安装
跨平台设计适配指南:移动端、桌面端、平板、打印及邮件适配策略与实施方法
38,800 周安装
前端打磨(Polish)终极指南:提升产品细节与用户体验的系统化检查清单
39,900 周安装
Web应用测试指南:使用Python Playwright自动化测试本地Web应用
39,500 周安装
Azure Cloud Migrate:AWS Lambda到Azure Functions迁移工具 - 微软官方评估与代码迁移
38,700 周安装
Excel财务建模规范与xlsx文件处理指南:专业格式、零错误公式与数据分析
42,900 周安装
INTEGRATIONS/pinecone/, INTEGRATIONS/graph-databases/neo4j/