Snowflake平台技能：使用CLI、Cortex AI函数和Snowpark构建AI数据云应用 | SkillsMD

Snowflake平台技能：使用CLI、Cortex AI函数和Snowpark构建AI数据云应用

snowflake-platform by jezweb/claude-skills

326 周安装量

650 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/jezweb/claude-skills --skill snowflake-platform

AI/机器学习云服务数据分析

🇨🇳中文介绍

Snowflake Platform Skill

使用 snow CLI、Cortex AI 函数、Native Apps 和 Snowpark 在 Snowflake 的 AI 数据云上构建和部署应用程序。

快速开始

安装 Snowflake CLI

pip install snowflake-cli
snow --version  # 应显示 3.14.0+

配置连接

# 交互式设置
snow connection add

# 或手动创建 ~/.snowflake/config.toml



[connections.default]
account = "orgname-accountname"
user = "USERNAME"
authenticator = "SNOWFLAKE_JWT"
private_key_path = "~/.snowflake/rsa_key.p8"

测试连接

snow connection test -c default
snow sql -q "SELECT CURRENT_USER(), CURRENT_ACCOUNT()"

何时使用此技能

在以下情况使用：

在 Snowflake 平台上构建应用程序
在 SQL 查询中使用 Cortex AI 函数
为 Marketplace 开发 Native Apps
设置 JWT 密钥对认证
使用 Snowpark Python

不要在以下情况使用：

构建 Streamlit 应用（使用 streamlit-snowflake 技能）

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

相关 Skills

find-skills 技能搜索工具 - Vercel Labs 开源智能体技能包管理器

762,600 周安装

React 组合模式指南：Vercel 组件架构最佳实践，提升代码可维护性

105,000 周安装

Azure RBAC 权限管理工具：查找最小角色、创建自定义角色与自动化分配

101,200 周安装

Azure Data Explorer (Kusto) 查询技能：KQL数据分析、日志遥测与时间序列处理

100,500 周安装

函数	用途	GA 状态
`COMPLETE` / `AI_COMPLETE`	根据提示生成文本	2025年11月 GA
`SUMMARIZE` / `AI_SUMMARIZE`	总结文本	GA
`TRANSLATE` / `AI_TRANSLATE`	语言间翻译	2025年9月 GA
`SENTIMENT` / `AI_SENTIMENT`	情感分析	2025年7月 GA
`AI_FILTER`	自然语言过滤	2025年11月 GA
`AI_CLASSIFY`	对文本/图像分类	2025年11月 GA
`AI_AGG`	跨行聚合洞察	2025年11月 GA

-- 简单提示
SELECT SNOWFLAKE.CORTEX.COMPLETE(
    'llama3.1-70b',
    'Explain quantum computing in one sentence'
) AS response;

-- 包含对话历史
SELECT SNOWFLAKE.CORTEX.COMPLETE(
    'llama3.1-70b',
    [
        {'role': 'system', 'content': 'You are a helpful assistant'},
        {'role': 'user', 'content': 'What is Snowflake?'}
    ]
) AS response;

-- 包含选项
SELECT SNOWFLAKE.CORTEX.COMPLETE(
    'mistral-large2',
    'Summarize this document',
    {'temperature': 0.3, 'max_tokens': 500}
) AS response;

模型	上下文窗口	最适合
Claude 3.5 Sonnet	200,000 tokens	大型文档，长对话
Llama3.1-70b	128,000 tokens	复杂推理，中等文档
Llama3.1-8b	8,000 tokens	简单任务，短文本
Llama3.2-3b	8,000 tokens	快速推理，最小文本
Mistral-large2	可变	查看当前文档
Snowflake Arctic	可变	查看当前文档

-- 单个文本
SELECT SNOWFLAKE.CORTEX.SUMMARIZE(article_text) AS summary
FROM articles
LIMIT 10;

-- 跨行聚合（无上下文窗口限制）
SELECT AI_SUMMARIZE_AGG(review_text) AS all_reviews_summary
FROM product_reviews
WHERE product_id = 123;

-- 翻译成英语（自动检测源语言）
SELECT SNOWFLAKE.CORTEX.TRANSLATE(
    review_text,
    '',      -- 空值 = 自动检测源语言
    'en'     -- 目标语言
) AS translated
FROM international_reviews;

-- 指定源语言
SELECT AI_TRANSLATE(
    description,
    'es',    -- 源语言：西班牙语
    'en'     -- 目标语言：英语
) AS translated
FROM spanish_products;

-- 使用纯英语过滤
SELECT * FROM customer_feedback
WHERE AI_FILTER(
    feedback_text,
    'mentions shipping problems or delivery delays'
);

-- 与 SQL 谓词结合以实现最大优化
-- 查询规划器首先应用标准过滤器，然后在较小的数据集上应用 AI
SELECT * FROM support_tickets
WHERE created_date > '2025-01-01'  -- 首先应用标准过滤器
  AND AI_FILTER(description, 'customer is angry or frustrated');

-- 分类支持工单
SELECT
    ticket_id,
    AI_CLASSIFY(
        description,
        ['billing', 'technical', 'shipping', 'other']
    ) AS category
FROM support_tickets;

-- 这个看似简单的查询在大规模时可能很昂贵

SELECT
    product_id,
    AI_COMPLETE('mistral-large2', 'Summarize: ' || review_text) as summary
FROM product_reviews  -- 10 亿行
WHERE created_date > '2024-01-01';

-- 成本 = (输入 tokens + 输出 tokens) × 行数 × 模型费率
-- 在大规模时，这会快速累积

格式	示例	用于
组织-账户	`irjoewf-wq46213`	REST API URL，连接配置
账户定位器	`NZ90655`	JWT 声明（`iss`, `sub`）

SELECT CURRENT_ACCOUNT();  -- 返回：NZ90655

# 生成私钥（需要 PKCS#8 格式）
openssl genrsa 2048 | openssl pkcs8 -topk8 -inform PEM -out ~/.snowflake/rsa_key.p8 -nocrypt

# 生成公钥
openssl rsa -in ~/.snowflake/rsa_key.p8 -pubout -out ~/.snowflake/rsa_key.pub

# 获取 JWT 声明的指纹
openssl rsa -in ~/.snowflake/rsa_key.p8 -pubout -outform DER | \
  openssl dgst -sha256 -binary | openssl enc -base64

-- 在 Snowflake 工作表中（需要 ACCOUNTADMIN 或 SECURITYADMIN）
ALTER USER my_user SET RSA_PUBLIC_KEY='MIIBIjANBgkq...';

iss: ACCOUNT_LOCATOR.USERNAME.SHA256:fingerprint
sub: ACCOUNT_LOCATOR.USERNAME

iss: NZ90655.JEZWEB.SHA256:jpZO6LvU2SpKd8tE61OGfas5ZXpfHloiJd7XHLPDEEA=
sub: NZ90655.JEZWEB

# 在 SPCS 容器内无需特殊配置
import snowflake.connector

# 自动检测 SPCS_TOKEN 环境变量
conn = snowflake.connector.connect()

# 初始化项目
snow init

# 执行 SQL
snow sql -q "SELECT 1"
snow sql -f query.sql

# 查看日志
snow logs

# 开发
snow app run              # 部署并本地运行
snow app deploy           # 仅上传到阶段
snow app teardown         # 移除应用

# 版本控制
snow app version create V1_0
snow app version list
snow app version drop V1_0

# 发布
snow app publish --version V1_0 --patch 0

# 发布通道
snow app release-channel list
snow app release-channel add-version --channel ALPHA --version V1_0
snow app release-directive set default --version V1_0 --patch 0 --channel DEFAULT

snow streamlit deploy --replace
snow streamlit deploy --replace --open

snow stage list
snow stage copy @my_stage/file.txt ./local/

my_native_app/
├── snowflake.yml           # 项目配置
├── manifest.yml            # 应用清单
├── setup_script.sql        # 安装脚本
├── app/
│   └── streamlit/
│       ├── environment.yml
│       └── streamlit_app.py
└── scripts/
    └── setup.sql

definition_version: 2

native_app:
  name: my_app
  package:
    name: my_app_pkg
    distribution: external    # 用于 marketplace
  application:
    name: my_app
  source_stage: stage/dev
  artifacts:
    - src: manifest.yml
      dest: manifest.yml
    - src: setup_script.sql
      dest: setup_script.sql
    - src: app/streamlit/environment.yml
      dest: streamlit/environment.yml
    - src: app/streamlit/streamlit_app.py
      dest: streamlit/streamlit_app.py
  enable_release_channels: true  # 用于 ALPHA/BETA 通道

manifest_version: 1

artifacts:
  setup_script: setup_script.sql
  default_streamlit: streamlit/streamlit_app.py

# 注意：不要包含权限部分 - Native Apps 不能声明权限

-- 1. 创建网络规则（在真实数据库中，不在应用包中）
CREATE DATABASE IF NOT EXISTS MY_APP_UTILS;

CREATE OR REPLACE NETWORK RULE MY_APP_UTILS.PUBLIC.api_rule
  MODE = EGRESS
  TYPE = HOST_PORT
  VALUE_LIST = ('api.example.com:443');

-- 2. 创建集成
CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION my_app_integration
  ALLOWED_NETWORK_RULES = (MY_APP_UTILS.PUBLIC.api_rule)
  ENABLED = TRUE;

-- 3. 授予应用权限
GRANT USAGE ON INTEGRATION my_app_integration
  TO APPLICATION MY_APP;

-- 4. 关键：附加到 Streamlit（每次部署后必须重复！）
ALTER STREAMLIT MY_APP.config_schema.my_streamlit
  SET EXTERNAL_ACCESS_INTEGRATIONS = (my_app_integration);

-- 1. 在应用包中创建 shared_data 模式
CREATE SCHEMA IF NOT EXISTS MY_APP_PKG.SHARED_DATA;

-- 2. 创建引用外部数据库的视图
CREATE OR REPLACE VIEW MY_APP_PKG.SHARED_DATA.MY_VIEW AS
SELECT * FROM EXTERNAL_DB.SCHEMA.TABLE;

-- 3. 授予 REFERENCE_USAGE（关键！）
GRANT REFERENCE_USAGE ON DATABASE EXTERNAL_DB
  TO SHARE IN APPLICATION PACKAGE MY_APP_PKG;

-- 4. 授予共享访问权限
GRANT USAGE ON SCHEMA MY_APP_PKG.SHARED_DATA
  TO SHARE IN APPLICATION PACKAGE MY_APP_PKG;
GRANT SELECT ON ALL VIEWS IN SCHEMA MY_APP_PKG.SHARED_DATA
  TO SHARE IN APPLICATION PACKAGE MY_APP_PKG;

# 1. 部署应用
snow app run

# 2. 创建版本
snow app version create V1_0

# 3. 检查安全审查状态
snow app version list
# 等待 review_status = APPROVED

# 4. 设置发布指令
snow app release-directive set default --version V1_0 --patch 0 --channel DEFAULT

# 5. 在 Snowsight Provider Studio 中创建列表（仅限 UI）

状态	含义	操作
`NOT_REVIEWED`	扫描未运行	检查 DISTRIBUTION 是否为 EXTERNAL
`IN_PROGRESS`	扫描运行中	等待
`APPROVED`	通过	可以发布
`REJECTED`	失败	修复问题或申诉
`MANUAL_REVIEW`	人工审查中	等待（可能需要数天）

字段	最大长度	说明
标题	72 字符	应用名称
副标题	128 字符	一句话描述
描述	10,000 字符	HTML 编辑器
业务需求	最多 6 个	从下拉列表中选择
快速开始示例	最多 10 个	标题 + 描述 + SQL
数据字典	必需	数据列表的强制要求（2025年）

from snowflake.snowpark import Session

connection_params = {
    "account": "orgname-accountname",
    "user": "USERNAME",
    "password": "PASSWORD",  # 或使用 private_key_path
    "warehouse": "COMPUTE_WH",
    "database": "MY_DB",
    "schema": "PUBLIC"
}

session = Session.builder.configs(connection_params).create()

# 读取表
df = session.table("MY_TABLE")

# 过滤和选择
result = df.filter(df["STATUS"] == "ACTIVE") \
           .select("ID", "NAME", "CREATED_AT") \
           .sort("CREATED_AT", ascending=False)

# 执行
result.show()

# 收集到 Python
rows = result.collect()

# 错误 - dict() 在 Snowpark Row 上不起作用
config = dict(result[0])

# 正确 - 显式访问列
row = result[0]
config = {
    'COLUMN_A': row['COLUMN_A'],
    'COLUMN_B': row['COLUMN_B'],
}

# v4.2.0 之前 - rowcount 对于 CTAS 返回 -1
cursor.execute("CREATE TABLE new_table AS SELECT * FROM source WHERE active = true")
print(cursor.rowcount)  # 返回 -1（无帮助！）

# v4.2.0 之后 - stats 属性显示实际行数
cursor.execute("CREATE TABLE new_table AS SELECT * FROM source WHERE active = true")
print(cursor.stats)  # 返回 {'rows_inserted': 1234, 'duplicates': 0, ...}

from snowflake.snowpark.functions import udf, sproc

# 注册 UDF
@udf(name="my_udf", replace=True)
def my_udf(x: int) -> int:
    return x * 2

# 注册存储过程
@sproc(name="my_sproc", replace=True)
def my_sproc(session: Session, table_name: str) -> str:
    df = session.table(table_name)
    count = df.count()
    return f"Row count: {count}"

https://{org-account}.snowflakecomputing.com/api/v2/statements

const headers = {
  'Authorization': `Bearer ${jwt}`,
  'Content-Type': 'application/json',
  'Accept': 'application/json',  // 必需 - 如果缺少会导致 "null" 错误
  'User-Agent': 'MyApp/1.0',
};

// 提交返回 statementHandle，而不是结果
const submit = await fetch(url, { method: 'POST', headers, body });
const { statementHandle } = await submit.json();

// 轮询直到完成
while (true) {
  const status = await fetch(`${url}/${statementHandle}`, { headers });
  if (status.status === 200) break;  // 完成
  if (status.status === 202) {
    await sleep(2000);  // 仍在运行
    continue;
  }
}

计划	限制	安全轮询
免费	50	45 次尝试 @ 2秒 = 最多 90秒
付费	1,000	100 次尝试 @ 500毫秒 = 最多 50秒

const response = await fetch(url, {
  signal: AbortSignal.timeout(30000),  // 30 秒
  headers,
});

POST /api/v2/statements/{statementHandle}/cancel

# 避免 - 每次迭代创建新连接
for i in range(1000):
    conn = snowflake.connector.connect(...)
    cursor = conn.cursor()
    cursor.execute("SELECT 1")
    cursor.close()
    conn.close()

# 更好 - 重用连接
conn = snowflake.connector.connect(...)
cursor = conn.cursor()
for i in range(1000):
    cursor.execute("SELECT 1")
cursor.close()
conn.close()

import time
import snowflake.connector

def execute_with_retry(cursor, query, max_retries=3):
    for attempt in range(max_retries):
        try:
            return cursor.execute(query).fetchall()
        except snowflake.connector.errors.DatabaseError as e:
            if "throttled" in str(e).lower() and attempt < max_retries - 1:
                wait_time = 2 ** attempt  # 指数退避
                time.sleep(wait_time)
            else:
                raise

🇺🇸English

Snowflake Platform Skill

Build and deploy applications on Snowflake's AI Data Cloud using the snow CLI, Cortex AI functions, Native Apps, and Snowpark.

Quick Start

Install Snowflake CLI

pip install snowflake-cli
snow --version  # Should show 3.14.0+

Configure Connection

# Interactive setup
snow connection add

# Or create ~/.snowflake/config.toml manually



[connections.default]
account = "orgname-accountname"
user = "USERNAME"
authenticator = "SNOWFLAKE_JWT"
private_key_path = "~/.snowflake/rsa_key.p8"

Test Connection

snow connection test -c default
snow sql -q "SELECT CURRENT_USER(), CURRENT_ACCOUNT()"

When to Use This Skill

Use when:

Building applications on Snowflake platform
Using Cortex AI functions in SQL queries
Developing Native Apps for Marketplace
Setting up JWT key-pair authentication
Working with Snowpark Python

Don't use when:

Building Streamlit apps (use streamlit-snowflake skill)
Need data engineering/ETL patterns
Working with BI tools (Tableau, Looker)

Cortex AI Functions

Snowflake Cortex provides LLM capabilities directly in SQL. Functions are in the SNOWFLAKE.CORTEX schema.

Core Functions

Function	Purpose	GA Status
`COMPLETE` / `AI_COMPLETE`	Text generation from prompt	GA Nov 2025
`SUMMARIZE` / `AI_SUMMARIZE`	Summarize text	GA
`TRANSLATE` / `AI_TRANSLATE`	Translate between languages	GA Sep 2025
`SENTIMENT` /

COMPLETE Function

-- Simple prompt
SELECT SNOWFLAKE.CORTEX.COMPLETE(
    'llama3.1-70b',
    'Explain quantum computing in one sentence'
) AS response;

-- With conversation history
SELECT SNOWFLAKE.CORTEX.COMPLETE(
    'llama3.1-70b',
    [
        {'role': 'system', 'content': 'You are a helpful assistant'},
        {'role': 'user', 'content': 'What is Snowflake?'}
    ]
) AS response;

-- With options
SELECT SNOWFLAKE.CORTEX.COMPLETE(
    'mistral-large2',
    'Summarize this document',
    {'temperature': 0.3, 'max_tokens': 500}
) AS response;

Available Models:

llama3.1-70b, llama3.1-8b, llama3.2-3b
mistral-large2, mistral-7b
snowflake-arctic
gemma-7b
claude-3-5-sonnet (200K context)

Model Context Windows (Updated 2025):

Model	Context Window	Best For
Claude 3.5 Sonnet	200,000 tokens	Large documents, long conversations
Llama3.1-70b	128,000 tokens	Complex reasoning, medium documents
Llama3.1-8b	8,000 tokens	Simple tasks, short text
Llama3.2-3b	8,000 tokens	Fast inference, minimal text
Mistral-large2	Variable	Check current docs
Snowflake Arctic	Variable	Check current docs

Token Math : ~4 characters = 1 token. A 32,000 character document ≈ 8,000 tokens.

Error : Input exceeds context window limit → Use smaller model or chunk your input.

SUMMARIZE Function

-- Single text
SELECT SNOWFLAKE.CORTEX.SUMMARIZE(article_text) AS summary
FROM articles
LIMIT 10;

-- Aggregate across rows (no context window limit)
SELECT AI_SUMMARIZE_AGG(review_text) AS all_reviews_summary
FROM product_reviews
WHERE product_id = 123;

TRANSLATE Function

-- Translate to English (auto-detect source)
SELECT SNOWFLAKE.CORTEX.TRANSLATE(
    review_text,
    '',      -- Empty = auto-detect source language
    'en'     -- Target language
) AS translated
FROM international_reviews;

-- Explicit source language
SELECT AI_TRANSLATE(
    description,
    'es',    -- Source: Spanish
    'en'     -- Target: English
) AS translated
FROM spanish_products;

AI_FILTER (Natural Language Filtering)

Performance : As of September 2025, AI_FILTER includes automatic optimization delivering 2-10x speedup and up to 60% token reduction for suitable queries.

-- Filter with plain English
SELECT * FROM customer_feedback
WHERE AI_FILTER(
    feedback_text,
    'mentions shipping problems or delivery delays'
);

-- Combine with SQL predicates for maximum optimization
-- Query planner applies standard filters FIRST, then AI on smaller dataset
SELECT * FROM support_tickets
WHERE created_date > '2025-01-01'  -- Standard filter applied first
  AND AI_FILTER(description, 'customer is angry or frustrated');

Best Practice : Always combine AI_FILTER with traditional SQL predicates (date ranges, categories, etc.) to reduce the dataset before AI processing. This maximizes the automatic optimization benefits.

Throttling : During peak usage, AI function requests may be throttled with retry-able errors. Implement exponential backoff for production applications (see Known Issue #10).

AI_CLASSIFY

-- Categorize support tickets
SELECT
    ticket_id,
    AI_CLASSIFY(
        description,
        ['billing', 'technical', 'shipping', 'other']
    ) AS category
FROM support_tickets;

Billing

Cortex AI functions bill based on tokens:

~4 characters = 1 token
Both input AND output tokens are billed
Rates vary by model (larger models cost more)

Cost Management at Scale (Community-sourced):

Real-world production case study showed a single AI_COMPLETE query processing 1.18 billion records cost nearly $5K in credits. Cost drivers to watch:

Cross-region inference : Models not available in your region incur additional data transfer costs
Warehouse idle time : Unused compute still bills, but aggressive auto-suspend adds resume overhead
Large table joins : Complex queries with AI functions multiply costs

-- This seemingly simple query can be expensive at scale

SELECT
    product_id,
    AI_COMPLETE('mistral-large2', 'Summarize: ' || review_text) as summary
FROM product_reviews  -- 1 billion rows
WHERE created_date > '2024-01-01';

-- Cost = (input tokens + output tokens) × row count × model rate
-- At scale, this adds up fast

Best Practices :

Filter datasets BEFORE applying AI functions
Right-size warehouses (don't over-provision)
Monitor credit consumption with QUERY_HISTORY views
Consider batch processing instead of row-by-row AI operations

Source : The Hidden Cost of Snowflake Cortex AI (Community blog with billing evidence)

Authentication

JWT Key-Pair Authentication

Critical : Snowflake uses TWO account identifier formats:

Format	Example	Used For
Organization-Account	`irjoewf-wq46213`	REST API URLs, connection config
Account Locator	`NZ90655`	JWT claims (`iss`, `sub`)

These are NOT interchangeable!

Discover Your Account Locator

SELECT CURRENT_ACCOUNT();  -- Returns: NZ90655

Generate RSA Key Pair

# Generate private key (PKCS#8 format required)
openssl genrsa 2048 | openssl pkcs8 -topk8 -inform PEM -out ~/.snowflake/rsa_key.p8 -nocrypt

# Generate public key
openssl rsa -in ~/.snowflake/rsa_key.p8 -pubout -out ~/.snowflake/rsa_key.pub

# Get fingerprint for JWT claims
openssl rsa -in ~/.snowflake/rsa_key.p8 -pubout -outform DER | \
  openssl dgst -sha256 -binary | openssl enc -base64

Register Public Key with User

-- In Snowflake worksheet (requires ACCOUNTADMIN or SECURITYADMIN)
ALTER USER my_user SET RSA_PUBLIC_KEY='MIIBIjANBgkq...';

JWT Claim Format

iss: ACCOUNT_LOCATOR.USERNAME.SHA256:fingerprint
sub: ACCOUNT_LOCATOR.USERNAME

Example:

iss: NZ90655.JEZWEB.SHA256:jpZO6LvU2SpKd8tE61OGfas5ZXpfHloiJd7XHLPDEEA=
sub: NZ90655.JEZWEB

SPCS Container Authentication (v4.2.0+)

New in January 2026 : Connector automatically detects and uses SPCS service identifier tokens when running inside Snowpark Container Services.

# No special configuration needed inside SPCS containers
import snowflake.connector

# Auto-detects SPCS_TOKEN environment variable
conn = snowflake.connector.connect()

This enables seamless authentication from containerized Snowpark services without explicit credentials.

Source : Release v4.2.0

Snow CLI Commands

Project Management

# Initialize project
snow init

# Execute SQL
snow sql -q "SELECT 1"
snow sql -f query.sql

# View logs
snow logs

Native App Commands

# Development
snow app run              # Deploy and run locally
snow app deploy           # Upload to stage only
snow app teardown         # Remove app

# Versioning
snow app version create V1_0
snow app version list
snow app version drop V1_0

# Publishing
snow app publish --version V1_0 --patch 0

# Release Channels
snow app release-channel list
snow app release-channel add-version --channel ALPHA --version V1_0
snow app release-directive set default --version V1_0 --patch 0 --channel DEFAULT

Streamlit Commands

snow streamlit deploy --replace
snow streamlit deploy --replace --open

Stage Commands

snow stage list
snow stage copy @my_stage/file.txt ./local/

Native App Development

Project Structure

my_native_app/
├── snowflake.yml           # Project config
├── manifest.yml            # App manifest
├── setup_script.sql        # Installation script
├── app/
│   └── streamlit/
│       ├── environment.yml
│       └── streamlit_app.py
└── scripts/
    └── setup.sql

snowflake.yml

definition_version: 2

native_app:
  name: my_app
  package:
    name: my_app_pkg
    distribution: external    # For marketplace
  application:
    name: my_app
  source_stage: stage/dev
  artifacts:
    - src: manifest.yml
      dest: manifest.yml
    - src: setup_script.sql
      dest: setup_script.sql
    - src: app/streamlit/environment.yml
      dest: streamlit/environment.yml
    - src: app/streamlit/streamlit_app.py
      dest: streamlit/streamlit_app.py
  enable_release_channels: true  # For ALPHA/BETA channels

manifest.yml

manifest_version: 1

artifacts:
  setup_script: setup_script.sql
  default_streamlit: streamlit/streamlit_app.py

# Note: Do NOT include privileges section - Native Apps can't declare privileges

External Access Integration

Native Apps calling external APIs need this setup:

-- 1. Create network rule (in a real database, NOT app package)
CREATE DATABASE IF NOT EXISTS MY_APP_UTILS;

CREATE OR REPLACE NETWORK RULE MY_APP_UTILS.PUBLIC.api_rule
  MODE = EGRESS
  TYPE = HOST_PORT
  VALUE_LIST = ('api.example.com:443');

-- 2. Create integration
CREATE OR REPLACE EXTERNAL ACCESS INTEGRATION my_app_integration
  ALLOWED_NETWORK_RULES = (MY_APP_UTILS.PUBLIC.api_rule)
  ENABLED = TRUE;

-- 3. Grant to app
GRANT USAGE ON INTEGRATION my_app_integration
  TO APPLICATION MY_APP;

-- 4. CRITICAL: Attach to Streamlit (must repeat after EVERY deploy!)
ALTER STREAMLIT MY_APP.config_schema.my_streamlit
  SET EXTERNAL_ACCESS_INTEGRATIONS = (my_app_integration);

Warning : Step 4 resets on every snow app run. Must re-run after each deploy!

Shared Data Pattern

When your Native App needs data from an external database:

-- 1. Create shared_data schema in app package
CREATE SCHEMA IF NOT EXISTS MY_APP_PKG.SHARED_DATA;

-- 2. Create views referencing external database
CREATE OR REPLACE VIEW MY_APP_PKG.SHARED_DATA.MY_VIEW AS
SELECT * FROM EXTERNAL_DB.SCHEMA.TABLE;

-- 3. Grant REFERENCE_USAGE (CRITICAL!)
GRANT REFERENCE_USAGE ON DATABASE EXTERNAL_DB
  TO SHARE IN APPLICATION PACKAGE MY_APP_PKG;

-- 4. Grant access to share
GRANT USAGE ON SCHEMA MY_APP_PKG.SHARED_DATA
  TO SHARE IN APPLICATION PACKAGE MY_APP_PKG;
GRANT SELECT ON ALL VIEWS IN SCHEMA MY_APP_PKG.SHARED_DATA
  TO SHARE IN APPLICATION PACKAGE MY_APP_PKG;

In setup_script.sql, reference shared_data.view_name (NOT the original database).

Marketplace Publishing

Security Review Workflow

# 1. Deploy app
snow app run

# 2. Create version
snow app version create V1_0

# 3. Check security review status
snow app version list
# Wait for review_status = APPROVED

# 4. Set release directive
snow app release-directive set default --version V1_0 --patch 0 --channel DEFAULT

# 5. Create listing in Snowsight Provider Studio (UI only)

Security Review Statuses

Status	Meaning	Action
`NOT_REVIEWED`	Scan hasn't run	Check DISTRIBUTION is EXTERNAL
`IN_PROGRESS`	Scan running	Wait
`APPROVED`	Passed	Can publish
`REJECTED`	Failed	Fix issues or appeal
`MANUAL_REVIEW`	Human reviewing	Wait (can take days)

Triggers manual review : External access integrations, Streamlit components, network calls.

Provider Studio Fields

Field	Max Length	Notes
Title	72 chars	App name
Subtitle	128 chars	One-liner
Description	10,000 chars	HTML editor
Business Needs	6 max	Select from dropdown
Quick Start Examples	10 max	Title + Description + SQL
Data Dictionary	Required	Mandatory for data listings (2025)

Paid Listing Prerequisites

| Requirement

---|---
1 | Full Snowflake account (not trial)
2 | ACCOUNTADMIN role
3 | Provider Profile approved
4 | Stripe account configured
5 | Provider & Consumer Terms accepted
6 | Contact Marketplace Ops

Note : Cannot convert free listing to paid. Must create new listing.

Snowpark Python

Session Setup

from snowflake.snowpark import Session

connection_params = {
    "account": "orgname-accountname",
    "user": "USERNAME",
    "password": "PASSWORD",  # Or use private_key_path
    "warehouse": "COMPUTE_WH",
    "database": "MY_DB",
    "schema": "PUBLIC"
}

session = Session.builder.configs(connection_params).create()

DataFrame Operations

# Read table
df = session.table("MY_TABLE")

# Filter and select
result = df.filter(df["STATUS"] == "ACTIVE") \
           .select("ID", "NAME", "CREATED_AT") \
           .sort("CREATED_AT", ascending=False)

# Execute
result.show()

# Collect to Python
rows = result.collect()

Row Access (Common Gotcha)

# WRONG - dict() doesn't work on Snowpark Row
config = dict(result[0])

# CORRECT - Access columns explicitly
row = result[0]
config = {
    'COLUMN_A': row['COLUMN_A'],
    'COLUMN_B': row['COLUMN_B'],
}

DML Statistics (v4.2.0+)

New in January 2026 : SnowflakeCursor.stats property exposes granular DML statistics for operations where rowcount is insufficient (e.g., CTAS queries).

# Before v4.2.0 - rowcount returns -1 for CTAS
cursor.execute("CREATE TABLE new_table AS SELECT * FROM source WHERE active = true")
print(cursor.rowcount)  # Returns -1 (not helpful!)

# After v4.2.0 - stats property shows actual row counts
cursor.execute("CREATE TABLE new_table AS SELECT * FROM source WHERE active = true")
print(cursor.stats)  # Returns {'rows_inserted': 1234, 'duplicates': 0, ...}

Source : Release v4.2.0

UDFs and Stored Procedures

from snowflake.snowpark.functions import udf, sproc

# Register UDF
@udf(name="my_udf", replace=True)
def my_udf(x: int) -> int:
    return x * 2

# Register Stored Procedure
@sproc(name="my_sproc", replace=True)
def my_sproc(session: Session, table_name: str) -> str:
    df = session.table(table_name)
    count = df.count()
    return f"Row count: {count}"

REST API (SQL API v2)

The REST API is the foundation for programmatic Snowflake access from Cloudflare Workers.

Endpoint

https://{org-account}.snowflakecomputing.com/api/v2/statements

Required Headers (CRITICAL)

ALL requests must include these headers - missing Accept causes silent failures:

const headers = {
  'Authorization': `Bearer ${jwt}`,
  'Content-Type': 'application/json',
  'Accept': 'application/json',  // REQUIRED - "null" error if missing
  'User-Agent': 'MyApp/1.0',
};

Async Query Handling

Even simple queries return async (HTTP 202). Always implement polling:

// Submit returns statementHandle, not results
const submit = await fetch(url, { method: 'POST', headers, body });
const { statementHandle } = await submit.json();

// Poll until complete
while (true) {
  const status = await fetch(`${url}/${statementHandle}`, { headers });
  if (status.status === 200) break;  // Complete
  if (status.status === 202) {
    await sleep(2000);  // Still running
    continue;
  }
}

Workers Subrequest Limits

Plan	Limit	Safe Polling
Free	50	45 attempts @ 2s = 90s max
Paid	1,000	100 attempts @ 500ms = 50s max

Fetch Timeouts

Workers fetch() has no default timeout. Always use AbortController:

const response = await fetch(url, {
  signal: AbortSignal.timeout(30000),  // 30 seconds
  headers,
});

Cancel on Timeout

Cancel queries when timeout occurs to avoid warehouse costs:

POST /api/v2/statements/{statementHandle}/cancel

See templates/snowflake-rest-client.ts for complete implementation.

Known Issues

1. Account Identifier Confusion

Symptom : JWT auth fails silently, queries don't appear in Query History.

Cause : Using org-account format in JWT claims instead of account locator.

Fix : Use SELECT CURRENT_ACCOUNT() to get the actual account locator.

2. External Access Reset

Symptom : API calls fail after snow app run.

Cause : External access integration attachment resets on every deploy.

Fix : Re-run ALTER STREAMLIT ... SET EXTERNAL_ACCESS_INTEGRATIONS after each deploy.

3. Release Channel Syntax

Symptom : ALTER APPLICATION PACKAGE ... SET DEFAULT RELEASE DIRECTIVE fails.

Cause : Legacy SQL syntax doesn't work with release channels enabled.

Fix : Use snow CLI: snow app release-directive set default --version V1_0 --patch 0 --channel DEFAULT

4. Artifact Nesting

Symptom : Files appear in streamlit/streamlit/ instead of streamlit/.

Cause : Directory mappings in snowflake.yml nest the folder name.

Fix : List individual files explicitly in artifacts, not directories.

5. REFERENCE_USAGE Missing

Symptom : "A view that is added to the shared content cannot reference objects from other databases"

Cause : Missing GRANT REFERENCE_USAGE ON DATABASE for shared data.

Fix : Always grant REFERENCE_USAGE before snow app run when using external databases.

6. REST API Missing Accept Header

Symptom : "Unsupported Accept header null is specified" on polling requests.

Cause : Initial request had Accept: application/json but polling request didn't.

Fix : Use consistent headers helper function for ALL requests (submit, poll, cancel).

7. Workers Fetch Hangs Forever

Symptom : Worker hangs indefinitely waiting for Snowflake response.

Cause : Cloudflare Workers' fetch() has no default timeout.

Fix : Always use AbortSignal.timeout(30000) on all Snowflake requests.

8. Too Many Subrequests

Symptom : "Too many subrequests" error during polling.

Cause : Polling every 1 second × 600 attempts = 600 subrequests exceeds limits.

Fix : Poll every 2-5 seconds, limit to 45 (free) or 100 (paid) attempts.

9. Warehouse Not Auto-Resuming (Perceived)

Symptom : Queries return statementHandle but never complete (code 090001 indefinitely).

Cause : 090001 means "running" not error. Warehouse IS resuming, just takes time.

Fix : Auto-resume works. Wait longer or explicitly resume first: POST /api/v2/warehouses/{wh}:resume

10. Memory Leaks in Connector 4.x (Active Issue)

Error : Long-running Python applications show memory growth over time Source : GitHub Issue #2727, #2725 Affects : snowflake-connector-python 4.0.0 - 4.2.0

Why It Happens :

SessionManager uses defaultdict which prevents garbage collection
SnowflakeRestful.fetch() holds references that leak during query execution

Prevention : Reuse connections rather than creating new ones repeatedly. Fix is in progress via PR #2741 and PR #2726.

# AVOID - creates new connection each iteration
for i in range(1000):
    conn = snowflake.connector.connect(...)
    cursor = conn.cursor()
    cursor.execute("SELECT 1")
    cursor.close()
    conn.close()

# BETTER - reuse connection
conn = snowflake.connector.connect(...)
cursor = conn.cursor()
for i in range(1000):
    cursor.execute("SELECT 1")
cursor.close()
conn.close()

Status : Fix expected in connector v4.3.0 or later

11. AI Function Throttling During Peak Usage

Error : "Request throttled due to high usage. Please retry." Source : Snowflake Cortex Documentation Affects : All Cortex AI functions (COMPLETE, FILTER, CLASSIFY, etc.)

Why It Happens : AI/LLM requests may be throttled during high usage periods to manage platform capacity. Throttled requests return errors and require manual retries.

Prevention : Implement retry logic with exponential backoff:

import time
import snowflake.connector

def execute_with_retry(cursor, query, max_retries=3):
    for attempt in range(max_retries):
        try:
            return cursor.execute(query).fetchall()
        except snowflake.connector.errors.DatabaseError as e:
            if "throttled" in str(e).lower() and attempt < max_retries - 1:
                wait_time = 2 ** attempt  # Exponential backoff
                time.sleep(wait_time)
            else:
                raise

Status : Documented behavior, no fix planned

References

Related Skills

streamlit-snowflake - Streamlit in Snowflake apps

Weekly Installs

326

Repository

jezweb/claude-skills

GitHub Stars

650

First Seen

Jan 20, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

Installed on

claude-code267

opencode212

gemini-cli212

cursor198

antigravity192

codex185

AI_SENTIMENT

Snowflake平台技能：使用CLI、Cortex AI函数和Snowpark构建AI数据云应用

🇨🇳中文介绍

Snowflake Platform Skill

快速开始

安装 Snowflake CLI

配置连接

测试连接

何时使用此技能

相关 Skills

Cortex AI 函数

核心函数

COMPLETE 函数

SUMMARIZE 函数

TRANSLATE 函数

AI_FILTER（自然语言过滤）

AI_CLASSIFY

计费

认证

JWT 密钥对认证

发现您的账户定位器

生成 RSA 密钥对

为用户注册公钥

JWT 声明格式

SPCS 容器认证（v4.2.0+）

Snow CLI 命令

项目管理

Native App 命令

Streamlit 命令

Stage 命令

Native App 开发

项目结构

snowflake.yml

manifest.yml

外部访问集成

共享数据模式

Marketplace 发布

安全审查工作流

安全审查状态

Provider Studio 字段

付费列表先决条件

| 要求

Snowpark Python

会话设置

DataFrame 操作

行访问（常见陷阱）

DML 统计信息（v4.2.0+）

UDF 和存储过程

REST API（SQL API v2）

端点

必需头部（关键）

异步查询处理

Workers 子请求限制

Fetch 超时

超时取消

已知问题

1. 账户标识符混淆

2. 外部访问重置

3. 发布通道语法

4. 工件嵌套

5. REFERENCE_USAGE 缺失

6. REST API 缺少 Accept 头部

7. Workers Fetch 永久挂起

8. 子请求过多

9. 仓库未自动恢复（感知）

10. 连接器 4.x 中的内存泄漏（活跃问题）

11. 高峰使用期间的 AI 函数限流

参考资料

相关技能

🇺🇸English

Snowflake Platform Skill

Quick Start

Install Snowflake CLI

Configure Connection

Test Connection

When to Use This Skill

Cortex AI Functions

Core Functions

COMPLETE Function

SUMMARIZE Function

TRANSLATE Function