Dagster自定义组件创建指南：自动化搭建与演示模式实现

create-custom-dagster-component by c00ldudenoonan/economic-data-project

1 周安装量

38 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/c00ldudenoonan/economic-data-project --skill create-custom-dagster-component

开发自动化数据处理

🇨🇳中文介绍

创建自定义组件

概述

此技能使用 dg CLI 工具（以 uv 作为包管理器）自动化创建和验证新的自定义 Dagster 组件。它集成了演示模式功能，用于创建可在本地运行而无需外部依赖的逼真演示。关于创建良好组件的文档可在此处找到 https://docs.dagster.io/guides/build/components/creating-new-components/creating-and-registering-a-component，以及此处 https://github.com/dagster-io/dagster/blob/master/python_modules/libraries/dagster-dbt/dagster_dbt/components/dbt_project/component.py 查看复杂组件的示例。

此技能的功能

调用此技能时，它将：

✅ 使用 dg scaffold component ComponentName 创建新的 Dagster 组件项目
✅ 在 build_defs() 函数中填充组件逻辑，包含真实和演示模式的实现
✅ 在组件 YAML 中实现 demo_mode 布尔标志，用于在真实和本地演示实现之间切换
✅ 创建 3-5 个具有适当依赖关系和技术种类的逼真资产
✅ 实例化组件 YAML 并使用 dg scaffold defs my_module.components.ComponentName my_component 命令填充它

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

步骤 1：获取组件名称和演示模式偏好

一个组件名称，或使用合理的默认值，如 MyDagsterComponent。验证：
- 名称以字母开头
- 仅包含字母数字字符、连字符或下划线
- 组件尚不存在（或询问是否覆盖）
是否需要演示模式支持（默认：对于演示项目为是）
是否需要创建自定义脚手架工具（参见步骤 5）

步骤 2：创建组件

使用 dg 创建组件

uv run dg scaffold component <ComponentName>

在 defs/components 中搭建新的 Dagster 组件
创建 component_name.py 文件

步骤 3：实现带演示模式的组件逻辑

填充组件文件中的 build_defs() 函数。组件应：

在组件参数中接受 demo_mode 参数（默认：False）
基于所选技术创建 3-5 个逼真资产
实现双重逻辑路径：
- 真实实现：连接到实际系统（主体应包含连接到真实系统的内容）
- 演示模式：使用本地数据/模拟行为进行演示
- Pydantic + YAML 字段中的重要配置：所有配置（新管道或资产）都应在组件中可配置，而不应在组件 Python 代码中硬编码。演示模式资产是应硬编码的例外。
- 所有资源都应在组件外部的 defs/ 文件夹中的 resources.py 文件中配置，使用 dg scaffold defs dagster.resource resources.py
- 所有调用可能触发多个资产的管道或 API 的组件，应允许在 YAML 中使用 assets 字段来描述底层组件中使用了哪些资产。有关如何设计良好 DSL 的最佳实践，请参阅 https://dagster.io/blog/dsls-to-the-rescue。参考 https://github.com/dagster-io/dagster/blob/master/python_modules/libraries/dagster-dbt/dagster_dbt/components/dbt_project/component.py 和 https://github.com/dagster-io/dagster/blob/master/python_modules/libraries/dagster-fivetran/dagster_fivetran/components/workspace_component/component.py 了解两种具有多资产的良好组件设计的参考架构。
设置适当的资产元数据：
- 使用 kinds 参数指示所使用的技术
- 添加描述性名称和文档
- 在资产之间建立适当的依赖关系
为下游集成设计资产键（关键）：
- 考虑哪些组件将消费您的资产
- 选择能最大限度减少下游配置的键结构
- 参见下面的“为集成设计资产键”部分

示例资产结构：

原始数据摄取资产
数据转换/清洗资产
业务逻辑/聚合资产
ML 模型或分析资产（如果适用）
输出/导出资产

为集成设计资产键

关键： 创建自定义组件时，请考虑什么将消费您组件的资产。您生成的资产键应与下游组件的期望保持一致，以避免需要按资产进行配置。

关键原则：上游定义，下游消费

您的组件（上游）应生成一种结构的资产键，使下游组件能够自然地引用。这消除了对 meta.dagster.asset_key 或复杂转换配置的需求。

常见下游消费者

如果 dbt 将消费您的资产：

使用模式：["<source_name>", "<table_name>"]
示例：["fivetran_raw", "customers"] 或 ["api_raw", "users"]
这允许 dbt 源自然引用：source('fivetran_raw', 'customers')

如果自定义 Dagster 资产将消费它们：

匹配这些资产在其 deps 中期望的键结构
尽可能减少嵌套（首选 2 级：["category", "name"]）
除非必要，避免深度嵌套的键，如 ["system", "subsystem", "type", "name"]

如果另一个集成组件将消费它们：

检查该组件期望的输入键结构
调整您的键以匹配，或提供清晰的映射文档

如果您的资产是中间资产并由您自己的组件消费：

使用清晰、反映数据流的层次化键
示例：["raw", "table"] → ["processed", "table"] → ["enriched", "table"]

示例：创建 API 摄取组件

import dagster as dg

class APIIngestionComponent(dg.Component, dg.Model, dg.Resolvable):
    """从 REST API 摄取数据。"""

    api_endpoint: str
    tables: list[str]
    demo_mode: bool = False

    def build_defs(self, context: dg.ComponentLoadContext) -> dg.Definitions:
        assets = []

        for table in self.tables:
            # 为 dbt 消费设计键：["api_raw", "table_name"]
            # 不要：["api", "ingestion", "raw", "table_name"]
            @dg.asset(
                key=dg.AssetKey(["api_raw", table]),  # ← 扁平化以便下游引用
                kinds={"api", "python"},
            )
            def ingest_table(context: dg.AssetExecutionContext):
                if self.demo_mode:
                    context.log.info(f"演示模式：模拟 API 调用 {table}")
                    return {"status": "demo", "rows": 100}
                else:
                    # 真实 API 调用
                    pass

            assets.append(ingest_table)

        return dg.Definitions(assets=assets)

结果： dbt 可以自然地引用这些资产：

# sources.yml
sources:
  - name: api_raw
    tables:
      - name: customers  # 匹配 ["api_raw", "customers"]

始终验证资产键是否与下游依赖项对齐：

# 检查资产键及其依赖项
uv run dg list defs --json | uv run python -c "
import sys, json
assets = json.load(sys.stdin)['assets']
print('\\n'.join([f\"{a['key']}: deps={a.get('deps', [])}\" for a in assets]))
"

需要验证的内容：

下游资产在其 deps 数组中列出了您的资产
没有具有不同结构的重复键
键简单且具有描述性（通常为 2 级：["category", "name"]）

要避免的反模式

❌ 嵌套过深： ["company", "team", "project", "environment", "table"]

下游难以引用
需要复杂的映射

❌ 结构不一致： 一些资产有 2 级，其他有 4 级

对消费者造成困惑
引用不可预测

❌ 通用名称： ["data", "table1"], ["output", "result"]

不清楚它们来自哪个系统
与其他组件冲突

✅ 良好模式：

["source_system", "entity"]: ["fivetran_raw", "customers"]
["integration", "object"]: ["salesforce", "accounts"]
["stage", "table"]: ["staging", "orders"]

关键：资产键在演示和生产模式中必须相同

重要： 无论 demo_mode 是 True 还是 False，资产键都应完全相同。只有资产实现（函数体）应在不同模式之间有所不同。

为什么这很重要：

下游组件通过键引用资产
依赖关系基于键建立
如果键在不同模式之间不同，切换模式时依赖关系会中断
在演示模式下的测试无法准确反映生产行为

示例 - 正确的方法：

def build_defs(self, context: dg.ComponentLoadContext) -> dg.Definitions:
    @dg.asset(
        key=dg.AssetKey(["fivetran_raw", "customers"]),  # ← 两种模式中相同的键
        kinds={"fivetran"},
    )
    def customers_sync(context: dg.AssetExecutionContext):
        if self.demo_mode:
            # 演示实现 - 模拟数据
            context.log.info("演示模式：创建空表")
            # ... 创建模拟表
        else:
            # 生产实现 - 真实的 Fivetran 同步
            context.log.info("生产：从 Fivetran 同步")
            # ... 调用 Fivetran API

    return dg.Definitions(assets=[customers_sync])

示例 - 错误的方法：

def build_defs(self, context: dg.ComponentLoadContext) -> dg.Definitions:
    if self.demo_mode:
        @dg.asset(
            key=dg.AssetKey(["demo", "customers"]),  # ❌ 不同的键！
        )
        def demo_customers():
            pass
        return dg.Definitions(assets=[demo_customers])
    else:
        @dg.asset(
            key=dg.AssetKey(["fivetran_raw", "customers"]),  # ❌ 不同的键！
        )
        def prod_customers():
            pass
        return dg.Definitions(assets=[prod_customers])

交叉参考 https://docs.dagster.io/llms.txt 获取最新的标题和描述
使用 https://docs.dagster.io/llms-full.txt 获取完整的 API 详细信息
检查可用集成：

uv run dg docs integrations --json

重要：始终为资产添加种类

在组件中创建资产时，始终添加 kinds 参数，以按技术/集成类型正确分类资产。这有助于：

在 Dagster UI 中过滤和组织资产
一目了然地了解技术栈
按集成类型分组资产

常见集成种类：

kinds={"fivetran"} 用于 Fivetran 资产
kinds={"dbt"} 用于 dbt 资产
kinds={"census"} 用于 Census 资产
kinds={"sling"} 用于 Sling 资产
kinds={"powerbi"} 用于 PowerBI 资产
kinds={"looker"} 用于 Looker 资产
kinds={"airbyte"} 用于 Airbyte 资产
kinds={"python"} 用于自定义 Python 处理
kinds={"snowflake"} 用于 Snowflake 资产

您可以通过运行以下命令验证种类是否正确显示：

uv run dg list defs

“种类”列应显示每个资产的集成类型。

示例组件结构：

from dagster import asset, Definitions, AssetExecutionContext
from pydantic import BaseModel

class MyComponentParams(BaseModel):
    demo_mode: bool = False
    # ... 其他参数

class MyComponent(Component):
    params_schema = MyComponentParams

    def build_defs(self, context: ComponentLoadContext) -> Definitions:
        params = self.params

        @asset(
            kinds={"fivetran"},  # ← 必需：添加集成种类
        )
        def raw_data(context: AssetExecutionContext):
            if params.demo_mode:
                # 演示实现 - 本地/模拟数据
                context.log.info("使用本地数据在演示模式下运行")
                pass
            else:
                # 真实实现 - 连接到实际系统
                context.log.info("使用真实数据源运行")
                pass

        @asset(
            deps=[raw_data],
            kinds={"dbt"},  # ← 必需：添加集成种类
        )
        def processed_data(context: AssetExecutionContext):
            if params.demo_mode:
                context.log.info("处理演示数据")
                pass
            else:
                context.log.info("处理真实数据")
                pass

        # ... 更多资产

        return Definitions(assets=[raw_data, processed_data, ...])

步骤 4：创建组件实例 YAML

使用 dg scaffold defs 创建组件实例：

uv run dg scaffold defs my_module.components.ComponentName my_component

这将创建一个 YAML 文件，其中应包含 demo_mode 参数：

type: my_module.components.ComponentName
attributes:
  demo_mode: true  # 对于本地演示设置为 true，对于真实部署设置为 false
  # ... 其他参数

步骤 5：创建自定义脚手架工具（可选）

如果用户在步骤 1 中请求了自定义脚手架工具，请按照此处的说明操作：https://docs.dagster.io/guides/build/components/creating-new-components/component-customization#customizing-scaffolding-behavior

自定义脚手架工具，为此组件的实例创建提供更好的开发人员体验。

步骤 6：验证设置和资产键对齐

运行这些命令以确保一切正常：

# 检查定义是否加载无误
uv run dg check defs

# 列出所有资产以验证它们已创建
uv run dg list defs

✅ 所有预期资产都已列出
✅ 组件实例已正确配置
✅ 未显示错误或警告
✅ demo_mode 标志在实现之间正确切换
✅ demo_mode: false 实现使用真实的资源并且是生产实现

关键：验证资产键对齐

通过运行以下命令检查资产依赖关系是否正确：

uv run dg list defs --json | uv run python -c "
import sys, json
data = json.load(sys.stdin)
assets = data.get('assets', [])
print('资产依赖关系：\\n')
for asset in assets:
    key = asset.get('key', 'unknown')
    deps = asset.get('deps', [])
    if deps:
        print(f'{key}')
        for dep in deps:
            print(f'  ← {dep}')
    else:
        print(f'{key} (无依赖关系)')
    print()
"

需要验证的内容：

✅ 下游资产在其 deps 数组中列出了上游资产
✅ 没有缺失的依赖关系
✅ 资产键简单且具有描述性（通常为 2 级：["category", "name"]）
✅ 资产键在演示模式和生产模式下都能一致工作

关键原则： 资产键在演示模式和生产模式之间应完全相同。只有资产实现（函数体）应有所不同。这确保：

依赖关系在两种模式下以相同方式工作
您可以在模式之间切换而无需重新配置下游组件
在演示模式下的测试能准确反映生产行为

步骤 7：测试演示模式

如果实现了演示模式：

确保组件 YAML 具有 demo_mode: true
运行 dg check defs 以验证其在本地工作
记录如何在演示和真实模式之间切换

组件在以下情况下完成：

✅ 组件脚手架已创建
✅ build_defs() 已实现，具有适当的资产逻辑
✅ 演示模式标志正常工作（如果适用）
✅ 非演示模式已实现与数据库或 API 的真实连接
✅ 创建了 3-5 个具有适当依赖关系的逼真资产
✅ 资产具有适当的 kinds 元数据
✅ 组件 YAML 实例已创建并配置
✅ 自定义脚手架工具已实现（如果请求）
✅ dg check defs 通过且无错误
✅ dg list defs 显示所有预期资产

完成后，告知用户：

组件已创建并验证
组件文件的位置
如何切换演示模式（如果适用）
如何进一步自定义组件
如何使用脚手架工具创建其他实例

🇺🇸English

Create a custom Component

Overview

This skill automates the creation and validation of a new custom Dagster component using the dg CLI tool with uv as a package manager. It incorporates demo mode functionality for creating realistic demonstrations that can run locally without external dependencies. The documentation for creating good components can be found here https://docs.dagster.io/guides/build/components/creating-new-components/creating-and-registering-a-component and here https://github.com/dagster-io/dagster/blob/master/python_modules/libraries/dagster-dbt/dagster_dbt/components/dbt_project/component.py for a complex example of a component.

What This Skill Does

When invoked, this skill will:

✅ Create a new Dagster Component project using dg scaffold component ComponentName
✅ Fills in the component logic in the build_defs() function with both real and demo mode implementations
✅ Implement a demo_mode boolean flag in the component YAML for toggling between real and local demo implementations
✅ Create 3-5 realistic assets with proper dependencies and technology kinds
✅ Instantiate a component YAML and fill it in using the dg scaffold defs my_module.components.ComponentName my_component command
✅ Optionally create a custom scaffolder if requested
✅ Validate that the component loaded correctly using dg check defs and dg list defs to ensure that the expected component instances are all loaded.
✅ Provide clear next steps for development

Prerequisites

Before running this skill, ensure:

uv is installed (check with uv --version)
You have a component name in mind (or will use the default)
You're in a Dagster project directory with the dg CLI available
You have inspected and understand how to create multi-asset integrations (see this guide: https://docs.dagster.io/integrations/guides/multi-asset-integration)

Skill Workflow

Step 1: Get Component Name and Demo Mode Preference

Ask the user for:

A component name, or use a sensible default like MyDagsterComponent. Validate that:
- The name starts with a letter
- Contains only alphanumeric characters, hyphens, or underscores
- The component doesn't already exist (or ask to overwrite)
Whether they want demo mode support (default: yes for demonstration projects)
Whether they want to create a custom scaffolder (see Step 5)

Step 2: Create Component

Use dg to create the component

uv run dg scaffold component <ComponentName>

This will:

Scaffold a new Dagster Component in defs/components
Create a component_name.py file

Step 3: Implement Component Logic with Demo Mode

Fill in the build_defs() function in the component file. The component should:

Accept ademo_mode parameter in the component params (default: False)
Create 3-5 realistic assets based on the chosen technologies
Implement dual logic paths:
- Real implementation: connects to actual systems (the body should include connections to real systems)
- Demo mode: uses local data/mocked behavior for demonstrations
- Important Configuration in Pydantic + YAML fields: All configuration (a new pipeline or asset) should be configurable in the Component and not hard coded in the Component Python. Demo mode assets are the exception that should be hard coded.
- All Resources should be configured outside of the Component in the defs/ folder in a resources.py file, using dg scaffold defs dagster.resource resources.py
- All Components that invoke a pipeline or API that might trigger multiple assets should allow for an assets field in the YAML that describes what assets are used in the underlying component. See https://dagster.io/blog/dsls-to-the-rescue for best practices in how to design a good DSL. Refer to and for two reference architectures for good component design with mutli-assets.

Example asset structure:

Raw data ingestion asset
Data transformation/cleaning asset
Business logic/aggregation asset
ML model or analytics asset (if applicable)
Output/export asset

Design Asset Keys for Integration

CRITICAL: When creating a custom component, consider what will consume your component's assets. The asset keys you generate should align with downstream component expectations to avoid requiring per-asset configuration.

Key Principle: Upstream Defines, Downstream Consumes

Your component (upstream) should generate asset keys in a structure that downstream components naturally reference. This eliminates the need for meta.dagster.asset_key or complex translation configuration.

Common Downstream Consumers

If dbt will consume your assets:

Use pattern: ["<source_name>", "<table_name>"]
Example: ["fivetran_raw", "customers"] or ["api_raw", "users"]
This allows dbt sources to reference naturally: source('fivetran_raw', 'customers')

If custom Dagster assets will consume them:

Match the key structure those assets expect in their deps
Minimize nesting when possible (prefer 2 levels: ["category", "name"])
Avoid deeply nested keys like ["system", "subsystem", "type", "name"] unless necessary

If another integration component will consume them:

Check that component's expected input key structure
Align your keys to match, or provide clear mapping documentation

If your assets are intermediate and consumed by your own component:

Use clear, hierarchical keys that reflect data flow
Example: ["raw", "table"] → ["processed", "table"] → ["enriched", "table"]

Example: Creating an API Ingestion Component

import dagster as dg

class APIIngestionComponent(dg.Component, dg.Model, dg.Resolvable):
    """Ingests data from REST APIs."""

    api_endpoint: str
    tables: list[str]
    demo_mode: bool = False

    def build_defs(self, context: dg.ComponentLoadContext) -> dg.Definitions:
        assets = []

        for table in self.tables:
            # Design key for dbt consumption: ["api_raw", "table_name"]
            # NOT: ["api", "ingestion", "raw", "table_name"]
            @dg.asset(
                key=dg.AssetKey(["api_raw", table]),  # ← Flattened for easy downstream reference
                kinds={"api", "python"},
            )
            def ingest_table(context: dg.AssetExecutionContext):
                if self.demo_mode:
                    context.log.info(f"Demo mode: Mocking API call for {table}")
                    return {"status": "demo", "rows": 100}
                else:
                    # Real API call
                    pass

            assets.append(ingest_table)

        return dg.Definitions(assets=assets)

Result: dbt can reference these assets naturally:

# sources.yml
sources:
  - name: api_raw
    tables:
      - name: customers  # Matches ["api_raw", "customers"]

Verification

Always verify asset keys align with downstream dependencies:

# Check asset keys and their dependencies
uv run dg list defs --json | uv run python -c "
import sys, json
assets = json.load(sys.stdin)['assets']
print('\\n'.join([f\"{a['key']}: deps={a.get('deps', [])}\" for a in assets]))
"

What to verify:

Downstream assets list your assets in their deps array
No duplicate keys with different structures
Keys are simple and descriptive (typically 2 levels: ["category", "name"])

Anti-Patterns to Avoid

❌ Too deeply nested: ["company", "team", "project", "environment", "table"]

Hard for downstream to reference
Requires complex mapping

❌ Inconsistent structure: Some assets with 2 levels, others with 4

Confusing for consumers
Unpredictable references

❌ Generic names: ["data", "table1"], ["output", "result"]

Not clear what system they're from
Conflicts with other components

✅ Good patterns:

["source_system", "entity"]: ["fivetran_raw", "customers"]
["integration", "object"]: ["salesforce", "accounts"]
["stage", "table"]: ["staging", "orders"]

Critical: Asset Keys Must Be Identical in Demo and Production Mode

IMPORTANT: Asset keys should be exactly the same whether demo_mode is True or False. Only the asset implementation (the function body) should differ between modes.

Why this matters:

Downstream components reference assets by key
Dependencies are established based on keys
If keys differ between modes, dependencies break when switching modes
Testing in demo mode won't accurately reflect production behavior

Example - CORRECT approach:

def build_defs(self, context: dg.ComponentLoadContext) -> dg.Definitions:
    @dg.asset(
        key=dg.AssetKey(["fivetran_raw", "customers"]),  # ← Same key in both modes
        kinds={"fivetran"},
    )
    def customers_sync(context: dg.AssetExecutionContext):
        if self.demo_mode:
            # Demo implementation - mock data
            context.log.info("Demo mode: Creating empty table")
            # ... create mock table
        else:
            # Production implementation - real Fivetran sync
            context.log.info("Production: Syncing from Fivetran")
            # ... call Fivetran API

    return dg.Definitions(assets=[customers_sync])

Example - INCORRECT approach:

def build_defs(self, context: dg.ComponentLoadContext) -> dg.Definitions:
    if self.demo_mode:
        @dg.asset(
            key=dg.AssetKey(["demo", "customers"]),  # ❌ Different key!
        )
        def demo_customers():
            pass
        return dg.Definitions(assets=[demo_customers])
    else:
        @dg.asset(
            key=dg.AssetKey(["fivetran_raw", "customers"]),  # ❌ Different key!
        )
        def prod_customers():
            pass
        return dg.Definitions(assets=[prod_customers])

Reference Documentation:

Cross-reference https://docs.dagster.io/llms.txt for up-to-date titles and descriptions
Use https://docs.dagster.io/llms-full.txt for full API details
Check available integrations with:

uv run dg docs integrations --json

Important: Always Add Kinds to Assets

When creating assets in your component, ALWAYS add thekinds parameter to properly categorize assets by their technology/integration type. This helps with:

Filtering and organizing assets in the Dagster UI
Understanding the technology stack at a glance
Grouping assets by integration type

Common integration kinds:

kinds={"fivetran"} for Fivetran assets
kinds={"dbt"} for dbt assets
kinds={"census"} for Census assets
kinds={"sling"} for Sling assets
kinds={"powerbi"} for PowerBI assets
kinds={"looker"} for Looker assets
kinds={"airbyte"} for Airbyte assets
kinds={"python"} for custom Python processing
kinds={"snowflake"} for Snowflake assets

You can verify kinds are showing correctly by running:

uv run dg list defs

The "Kinds" column should show the integration type for each asset.

Example Component Structure:

from dagster import asset, Definitions, AssetExecutionContext
from pydantic import BaseModel

class MyComponentParams(BaseModel):
    demo_mode: bool = False
    # ... other params

class MyComponent(Component):
    params_schema = MyComponentParams

    def build_defs(self, context: ComponentLoadContext) -> Definitions:
        params = self.params

        @asset(
            kinds={"fivetran"},  # ← REQUIRED: Add the integration kind
        )
        def raw_data(context: AssetExecutionContext):
            if params.demo_mode:
                # Demo implementation - local/mocked data
                context.log.info("Running in demo mode with local data")
                pass
            else:
                # Real implementation - connect to actual systems
                context.log.info("Running with real data source")
                pass

        @asset(
            deps=[raw_data],
            kinds={"dbt"},  # ← REQUIRED: Add the integration kind
        )
        def processed_data(context: AssetExecutionContext):
            if params.demo_mode:
                context.log.info("Processing demo data")
                pass
            else:
                context.log.info("Processing real data")
                pass

        # ... more assets

        return Definitions(assets=[raw_data, processed_data, ...])

Step 4: Create Component Instance YAML

Use dg scaffold defs to create the component instance:

uv run dg scaffold defs my_module.components.ComponentName my_component

This creates a YAML file that should include the demo_mode parameter:

type: my_module.components.ComponentName
attributes:
  demo_mode: true  # Set to true for local demos, false for real deployments
  # ... other params

Step 5: Create Custom Scaffolder (Optional)

If the user requested a custom scaffolder in Step 1, follow the directions here: https://docs.dagster.io/guides/build/components/creating-new-components/component-customization#customizing-scaffolding-behavior

Customize the scaffolder to provide a better developer experience for creating instances of this component.

Step 6: Validate Setup and Asset Key Alignment

Run these commands to ensure everything works:

# Check that definitions load without errors
uv run dg check defs

# List all assets to verify they were created
uv run dg list defs

Verify that:

✅ All expected assets are listed
✅ The component instance is properly configured
✅ No errors or warnings are shown
✅ The demo_mode flag toggles between implementations correctly
✅ The demo_mode: false implementation uses realistic resources and is a production implementation

CRITICAL: Verify Asset Key Alignment

Check that asset dependencies are correct by running:

uv run dg list defs --json | uv run python -c "
import sys, json
data = json.load(sys.stdin)
assets = data.get('assets', [])
print('Asset Dependencies:\n')
for asset in assets:
    key = asset.get('key', 'unknown')
    deps = asset.get('deps', [])
    if deps:
        print(f'{key}')
        for dep in deps:
            print(f'  ← {dep}')
    else:
        print(f'{key} (no dependencies)')
    print()
"

What to verify:

✅ Downstream assets list upstream assets in their deps array
✅ No missing dependencies
✅ Asset keys are simple and descriptive (typically 2 levels: ["category", "name"])
✅ Asset keys work consistently in both demo mode and production mode

Key Principle: Asset keys should be identical between demo mode and production mode. Only the asset implementation (the function body) should differ. This ensures:

Dependencies work the same in both modes
You can switch between modes without reconfiguring downstream components
Testing in demo mode accurately reflects production behavior

Step 7: Test Demo Mode

If demo mode was implemented:

Ensure the component YAML has demo_mode: true
Run dg check defs to verify it works locally
Document how to switch between demo and real modes

Success Criteria

The component is complete when:

✅ Component scaffolding is created
✅ build_defs() is implemented with proper asset logic
✅ Demo mode flag is working (if applicable)
✅ Non demo mode has realistic connections to the database or APIs implemented
✅ 3-5 realistic assets are created with proper dependencies
✅ Assets have appropriate kinds metadata
✅ Component YAML instance is created and configured
✅ Custom scaffolder is implemented (if requested)
✅ dg check defs passes without errors
✅ dg list defs shows all expected assets

Next Steps

After completion, inform the user:

The component has been created and validated
Location of the component files
How to toggle demo mode (if applicable)
How to customize the component further
How to create additional instances using the scaffolder

Weekly Installs

Repository

c00ldudenoonan/…-project

GitHub Stars

First Seen

Today

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

Installed on

zencoder1

amp1

cline1

openclaw1

opencode1

cursor1

agent-browser 浏览器自动化工具 - Vercel Labs 命令行网页操作与测试

147,400 周安装

Set proper asset metadata:

Use the kinds argument to indicate technologies in use
Add descriptive names and documentation
Establish proper dependencies between assets

Design asset keys for downstream integration (CRITICAL):

Consider what components will consume your assets
Choose key structures that minimize downstream configuration
See "Design Asset Keys for Integration" section below