⚠️

重要前提

安装AI Skills的关键前提是：必须科学上网，且开启TUN模式，这一点至关重要，直接决定安装能否顺利完成，在此郑重提醒三遍：科学上网，科学上网，科学上网。查看完整安装教程 →

数据模式与知识建模指南：数据库设计、知识图谱与API数据模型

data-schema-knowledge-modeling by lyndonkl/claude

45 周安装量

45 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/lyndonkl/claude --skill data-schema-knowledge-modeling

方法论数据库系统架构

🇨🇳中文介绍

数据模式与知识建模

目的

创建严谨、经过验证的实体、关系和约束模型，以实现正确的系统实现、知识表示和语义推理。

使用时机

在以下场景中调用此技能：

为新应用程序设计数据库模式（SQL、NoSQL、图数据库）
对具有许多实体和关系的复杂领域进行建模
为语义搜索/推理构建知识图谱或本体
定义 API 数据模型和契约
创建分类法或分类层次结构
建立数据治理和规范模型
将遗留模式迁移到现代架构
解决领域概念和关系中的歧义
实现跨系统的数据集成
记录系统不变式和业务规则

常见触发短语：

"为...设计一个模式"
"对实体和关系进行建模"
"创建一个知识图谱"
"数据模型是什么？"
"定义本体"
"我们应该如何构建这些数据？"
"映射...之间的关系"
"设计 API 数据模型"

定义

数据模式与知识建模 是正式定义以下内容的过程：

实体 - 存在的事物（用户、产品、订单、组织）
属性 - 实体的属性（名称、价格、状态、创建时间）
关系 - 实体之间的连接（用户拥有订单，产品属于类别）
- 规则和不变式（邮箱唯一、价格 > 0、一个主要地址）

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

相关 Skills

暂无相关 Skills

基数 - 每个实体的数量关系（一对一、一对多、多对多）

快速示例： 电子商务模式：

实体：用户、产品、订单、购物车、支付
关系：用户拥有多个订单，订单包含多个产品（通过订单项），用户有一个购物车
约束：邮箱必须唯一，订单总额与订单项总和匹配，支付金额等于订单总额
结果：明确的模型，可防止数据不一致

复制此清单并跟踪进度：

数据模式与知识建模进度：
- [ ] 步骤 1：收集领域需求和范围
- [ ] 步骤 2：识别实体和属性
- [ ] 步骤 3：定义关系和基数
- [ ] 步骤 4：指定约束和不变式
- [ ] 步骤 5：验证并记录模型

步骤 1：收集领域需求和范围

向用户询问领域描述、核心用例（此模式将支持哪些查询/操作）、现有数据（如果是迁移/集成）、性能/规模要求以及技术约束（SQL vs NoSQL vs 图数据库）。理解用例会塑造模型——OLTP、OLAP 和图遍历需要不同的设计。请参阅模式类型以获取指导。

步骤 2：识别实体和属性

从需求中提取名词（这些是候选实体）。对于每个实体，列出属性及其类型和可空性。使用 resources/template.md 进行系统性的实体识别。验证每个实体是否代表一个具有独立生命周期的不同概念。记录实体的目的和示例。

步骤 3：定义关系和基数

映射实体之间的连接（一对一、一对多、多对多）。对于多对多关系，识别连接表/实体。指定关系的方向性和可选性（X 可以在没有 Y 的情况下存在吗？）。使用 resources/methodology.md 处理复杂的关系模式，如层次结构、多态关联和时间关系。

步骤 4：指定约束和不变式

定义唯一性约束、外键关系、检查约束和业务规则。记录领域不变式（必须始终为真的规则）。识别派生/计算属性与存储属性。使用 resources/methodology.md 处理高级约束模式和验证策略。

步骤 5：验证并记录模型

创建包含完整模式定义的 data-schema-knowledge-modeling.md 文件。根据用例进行验证——该模式能否支持所需的查询/操作？检查规范化（消除冗余）或反规范化（针对特定查询进行优化）。使用 resources/evaluators/rubric_data_schema_knowledge_modeling.json 进行自我评估。最低标准：平均分 ≥ 3.5。

根据用例和技术选择：

关系型（SQL）模式

最适合： 事务性系统（OLTP）、强一致性、需要连接操作的复杂查询
模式： 规范化的表、外键、ACID 事务
示例用例： 电子商务订单、银行交易、人力资源系统
关键决策： 规范化级别（3NF 用于一致性 vs 反规范化用于读取性能）

文档/NoSQL 模式

最适合： 灵活/演进的结构、高写入吞吐量、反规范化的读取
模式： 嵌套文档、嵌入式关系、无连接操作
示例用例： 内容管理、用户档案、事件日志
关键决策： 嵌入 vs 引用（一对少用嵌入，一对多用引用）

图模式（本体）

最适合： 复杂关系、遍历查询、语义推理、知识图谱
模式： 节点（实体）、边（关系）、两者上的属性
示例用例： 社交网络、欺诈检测、推荐引擎、科学研究
关键决策： 属性图 vs RDF 三元组

事件/时间序列模式

最适合： 审计日志、指标、物联网数据、仅追加数据
模式： 不可变事件、基于时间的分区、聚合表
示例用例： 用户活动跟踪、监控、金融交易
关键决策： 原始事件 vs 预聚合摘要

维度（数据仓库）模式

最适合： 分析（OLAP）、聚合、历史报告
模式： 事实表 + 维度表（星型/雪花型模式）
示例用例： 商业智能、销售分析、客户 360 度视图
关键决策： 星型模式（反规范化） vs 雪花型模式（规范化的维度）

模式：实体生命周期建模 明确跟踪实体状态变化。示例：订单（草稿 → 待处理 → 已确认 → 已发货 → 已送达 → 已完成/已取消）。包含状态字段、每个状态的时间戳，如果需要历史记录，则包含转换表。

模式：软删除 从不物理删除记录——添加 deletedAt 时间戳。允许数据恢复、审计合规性和引用完整性。在查询中使用 WHERE deletedAt IS NULL 进行过滤。

模式：多态关联 实体与多种类型相关联。示例：评论可以针对帖子或照片。选项：(1) 单独的外键（commentableType + commentableId），(2) 每种类型的连接表，(3) 单表继承。

模式：时间/历史数据 跟踪随时间的变化。选项：(1) 每条记录的有效期/到期日期，(2) 单独的历史表，(3) 事件溯源（将所有更改存储为事件）。根据查询模式选择。

模式：多租户 按客户隔离数据。选项：(1) 单独的数据库（强隔离），(2) 共享模式，使用 tenantId 列（高效），(3) 同一数据库中的单独模式（平衡）。如果共享，在所有查询中添加 tenantId。

模式：层次结构 对树形/嵌套结构进行建模。选项：(1) 邻接表（parentId），(2) 嵌套集（左/右值），(3) 路径枚举（物化路径），(4) 闭包表（所有祖先-后代对）。在读取/写入性能之间权衡。

✓ 应该做：

从用例开始——模式服务于查询/操作
先规范化，然后针对特定性能需求进行反规范化
明确记录所有约束和不变式
使用有意义、一致的命名约定
考虑未来的演进——为可扩展性而设计
根据所有必需的用例验证模型
准确地对现实世界进行建模（不要强行适应技术）

✗ 不要做：

脱离用例孤立地设计模式
过早优化（在测量之前进行反规范化）
跳过约束定义（导致数据损坏）
使用通用名称（数据、值、事物）——要具体
忽略基数和可空性
在领域实体中建模实现细节
忘记从现有系统进行数据迁移的路径
在实体之间创建循环依赖

resources/template.md - 用于实体识别、关系映射和约束定义的结构化流程
resources/methodology.md - 高级模式：时间建模、图本体、模式演进、规范化策略
resources/examples/ - 包含验证的完整模式设计示例
resources/evaluators/rubric_data_schema_knowledge_modeling.json - 交付前的质量评估

何时选择哪种资源：

简单领域（< 10 个实体） → 从模板开始
复杂领域或图/本体 → 研究方法论中的高级模式
需要查看示例 → 查看示例文件夹
交付给用户之前 → 始终使用评估标准进行验证

预期交付物： data-schema-knowledge-modeling.md 文件，包含：领域描述、包含属性和类型的完整实体定义、包含基数的关系映射、约束规范、图表（ERD/图可视化）、针对用例的验证以及实现说明。

常见模式表示法：

ERD（实体-关系图）：实体和关系的可视化表示
UML 类图：带有继承和关联的面向对象视图
图图：用于图数据库的节点和边
JSON 模式：带有验证规则的 API/文档结构
SQL DDL：可执行的 CREATE TABLE 语句
本体（OWL/RDF）：语义网知识表示

🇺🇸English

Data Schema & Knowledge Modeling

Purpose
When to Use
What Is It
Workflow
Schema Types
Common Patterns
Guardrails
Quick Reference

Purpose

Create rigorous, validated models of entities, relationships, and constraints that enable correct system implementation, knowledge representation, and semantic reasoning.

When to Use

Invoke this skill when you need to:

Design database schema (SQL, NoSQL, graph) for new application
Model complex domain with many entities and relationships
Build knowledge graph or ontology for semantic search/reasoning
Define API data models and contracts
Create taxonomies or classification hierarchies
Establish data governance and canonical models
Migrate legacy schemas to modern architectures
Resolve ambiguity in domain concepts and relationships
Enable data integration across systems
Document system invariants and business rules

Common trigger phrases:

"Design a schema for..."
"Model the entities and relationships"
"Create a knowledge graph"
"What's the data model?"
"Define the ontology"
"How should we structure this data?"
"Map relationships between..."
"Design the API data model"

What Is It

Data schema & knowledge modeling is the process of formally defining:

Entities - Things that exist (User, Product, Order, Organization)
Attributes - Properties of entities (name, price, status, createdAt)
Relationships - Connections between entities (User owns Order, Product belongsTo Category)
Constraints - Rules and invariants (unique email, price > 0, one primary address)
Cardinality - How many of each (one-to-many, many-to-many)

Quick example: E-commerce schema:

Entities : User, Product, Order, Cart, Payment
Relationships : User has many Orders, Order contains many Products (via OrderItems), User has one Cart
Constraints : Email must be unique, Order total matches sum of OrderItems, Payment amount equals Order total
Result : Unambiguous model that prevents data inconsistencies

Workflow

Copy this checklist and track your progress:

Data Schema & Knowledge Modeling Progress:
- [ ] Step 1: Gather domain requirements and scope
- [ ] Step 2: Identify entities and attributes
- [ ] Step 3: Define relationships and cardinality
- [ ] Step 4: Specify constraints and invariants
- [ ] Step 5: Validate and document the model

Step 1: Gather domain requirements and scope

Ask user for domain description, core use cases (what queries/operations will this support), existing data (if migration/integration), performance/scale requirements, and technology constraints (SQL vs NoSQL vs graph database). Understanding use cases shapes the model - OLTP vs OLAP vs graph traversal require different designs. See Schema Types for guidance.

Step 2: Identify entities and attributes

Extract nouns from requirements (those are candidate entities). For each entity, list attributes with types and nullability. Use resources/template.md for systematic entity identification. Verify each entity represents a distinct concept with independent lifecycle. Document entity purpose and examples.

Step 3: Define relationships and cardinality

Map connections between entities (one-to-one, one-to-many, many-to-many). For many-to-many, identify junction tables/entities. Specify relationship directionality and optionality (can X exist without Y?). Use resources/methodology.md for complex relationship patterns like hierarchies, polymorphic associations, and temporal relationships.

Step 4: Specify constraints and invariants

Define uniqueness constraints, foreign key relationships, check constraints, and business rules. Document domain invariants (rules that must ALWAYS be true). Identify derived/computed attributes vs stored. Use resources/methodology.md for advanced constraint patterns and validation strategies.

Step 5: Validate and document the model

Create data-schema-knowledge-modeling.md file with complete schema definition. Validate against use cases - can the schema support required queries/operations? Check for normalization (eliminate redundancy) or denormalization (optimize for specific queries). Self-assess using resources/evaluators/rubric_data_schema_knowledge_modeling.json. Minimum standard: Average score ≥ 3.5.

Schema Types

Choose based on use case and technology:

Relational (SQL) Schema

Best for: Transactional systems (OLTP), strong consistency, complex queries with joins
Pattern: Normalized tables, foreign keys, ACID transactions
Example use cases: E-commerce orders, banking transactions, HR systems
Key decision: Normalization level (3NF for consistency vs denormalized for read performance)

Document/NoSQL Schema

Best for: Flexible/evolving structure, high write throughput, denormalized reads
Pattern: Nested documents, embedded relationships, no joins
Example use cases: Content management, user profiles, event logs
Key decision: Embed vs reference (embed for 1-to-few, reference for 1-to-many)

Graph Schema (Ontology)

Best for: Complex relationships, traversal queries, semantic reasoning, knowledge graphs
Pattern: Nodes (entities), edges (relationships), properties on both
Example use cases: Social networks, fraud detection, recommendation engines, scientific research
Key decision: Property graph vs RDF triples

Event/Time-Series Schema

Best for: Audit logs, metrics, IoT data, append-only data
Pattern: Immutable events, time-based partitioning, aggregation tables
Example use cases: User activity tracking, monitoring, financial transactions
Key decision: Raw events vs pre-aggregated summaries

Dimensional (Data Warehouse) Schema

Best for: Analytics (OLAP), aggregations, historical reporting
Pattern: Fact tables + dimension tables (star/snowflake schema)
Example use cases: Business intelligence, sales analytics, customer 360
Key decision: Star schema (denormalized) vs snowflake (normalized dimensions)

Common Patterns

Pattern: Entity Lifecycle Modeling Track entity state changes explicitly. Example: Order (draft → pending → confirmed → shipped → delivered → completed/cancelled). Include status field, timestamps for each state, and transitions table if history needed.

Pattern: Soft Deletes Never physically delete records - add deletedAt timestamp. Allows data recovery, audit compliance, and referential integrity. Filter WHERE deletedAt IS NULL in queries.

Pattern: Polymorphic Associations Entity relates to multiple types. Example: Comment can be on Post or Photo. Options: (1) separate foreign keys (commentableType + commentableId), (2) junction tables per type, (3) single table inheritance.

Pattern: Temporal/Historical Data Track changes over time. Options: (1) Effective/expiry dates per record, (2) separate history table, (3) event sourcing (store all changes as events). Choose based on query patterns.

Pattern: Multi-tenancy Isolate data per customer. Options: (1) Separate databases (strong isolation), (2) Shared schema with tenantId column (efficient), (3) Separate schemas in same DB (balance). Add tenantId to all queries if shared.

Pattern: Hierarchies Model trees/nested structures. Options: (1) Adjacency list (parentId), (2) Nested sets (left/right values), (3) Path enumeration (materialized path), (4) Closure table (all ancestor-descendant pairs). Trade-offs between read/write performance.

Guardrails

✓ Do:

Start with use cases - schema serves queries/operations
Normalize first, then denormalize for specific performance needs
Document all constraints and invariants explicitly
Use meaningful, consistent naming conventions
Consider future evolution - design for extensibility
Validate model against ALL required use cases
Model the real world accurately (don't force fit to technology)

✗ Don't:

Design schema in isolation from use cases
Premature optimization (denormalize before measuring)
Skip constraint definitions (leads to data corruption)
Use generic names (data, value, thing) - be specific
Ignore cardinality and nullability
Model implementation details in domain entities
Forget about data migration path from existing systems
Create circular dependencies between entities

Quick Reference

Resources:

resources/template.md - Structured process for entity identification, relationship mapping, and constraint definition
resources/methodology.md - Advanced patterns: temporal modeling, graph ontologies, schema evolution, normalization strategies
resources/examples/ - Worked examples showing complete schema designs with validation
resources/evaluators/rubric_data_schema_knowledge_modeling.json - Quality assessment before delivery

When to choose which resource:

Simple domain (< 10 entities) → Start with template
Complex domain or graph/ontology → Study methodology for advanced patterns
Need to see examples → Review examples folder
Before delivering to user → Always validate with rubric

Expected deliverable: data-schema-knowledge-modeling.md file containing: domain description, complete entity definitions with attributes and types, relationship mappings with cardinality, constraint specifications, diagram (ERD/graph visualization), validation against use cases, and implementation notes.

Common schema notations:

ERD (Entity-Relationship Diagram): Visual representation of entities and relationships
UML Class Diagram : Object-oriented view with inheritance and associations
Graph Diagram : Nodes and edges for graph databases
JSON Schema : API/document structure with validation rules
SQL DDL : Executable CREATE TABLE statements
Ontology (OWL/RDF) : Semantic web knowledge representation

Weekly Installs

Repository

lyndonkl/claude

GitHub Stars

First Seen

Jan 24, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

Installed on

gemini-cli40

opencode40

cursor39

codex38

github-copilot37

claude-code36