Claude Agent SDK 专家指南：构建自主AI代理，实现计算机控制与MCP集成

Claude SDK Expert by frankxai/claude-skills-library

5 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/frankxai/claude-skills-library --skill 'Claude SDK Expert'

AI/机器学习开发自动化

🇨🇳中文介绍

Claude SDK 专家技能

目的

本技能提供关于使用 Claude Agent SDK（原 Claude Code SDK）构建自主 AI 代理的全面指导，利用计算机使用能力、工具编排和 MCP 集成进行生产部署。

SDK 概述

Claude Agent SDK (2025)

Claude Agent SDK 支持构建能够与计算机交互、写入文件、运行命令并迭代其工作的自主代理。

演变： 从“Claude Code SDK”更名，以反映其超越编码的更广泛能力。

核心理念： 为 Claude 提供一台计算机，以解锁超越基于聊天的交互的代理效能。

核心能力

1. 计算机使用

革命性功能： Claude 可以控制计算机环境以完成任务。

这实现了：

文件系统操作（读取、写入、编辑）
终端命令执行
迭代调试和优化
多步骤自主工作流
现实世界任务完成

使用案例：

分析投资组合的金融代理
预订旅行的个人助理
处理复杂请求的客户支持
构建软件的开发代理
收集和分析数据的研究代理

2. 内置工具

文件操作：

Read - 读取文件内容
Write - 创建或覆盖文件

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

模式 1：自主任务完成

场景： 代理无需人工干预即可完成多步骤任务

User Request
    ↓
Claude analyzes task
    ↓
Breaks into subtasks
    ↓
Executes via tools (Read, Bash, Write, etc.)
    ↓
Iterates on failures
    ↓
Returns result

from anthropic import Anthropic

client = Anthropic()

response = client.messages.create(
    model="claude-sonnet-4-5",
    max_tokens=4096,
    tools=[
        {"type": "computer_use"},
        {"type": "bash"},
        {"type": "file_operations"}
    ],
    messages=[{
        "role": "user",
        "content": "Analyze the last 30 days of sales data and create a summary report"
    }]
)

# Claude autonomously:
# 1. Reads sales data files
# 2. Runs analysis scripts
# 3. Generates report
# 4. Saves to file

模式 2：人工介入审批

场景： 代理提出行动方案，在执行前等待审批

Task → Plan → Show to Human → Approve? → Execute → Result
                                ↓ No
                            Revise Plan

# Step 1: Generate plan
plan_response = client.messages.create(
    model="claude-sonnet-4-5",
    messages=[{
        "role": "user",
        "content": "Create a plan to refactor the authentication system"
    }]
)

# Step 2: Human reviews plan
if human_approves(plan_response.content):
    # Step 3: Execute with tools
    execution_response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=all_tools,
        messages=[{
            "role": "user",
            "content": f"Execute this plan: {plan_response.content}"
        }]
    )

模式 3：迭代优化

场景： 代理根据反馈/错误迭代工作

Attempt 1 → Error → Analyze → Attempt 2 → Error → Analyze → Attempt 3 → Success

内置功能： Claude SDK 通过计算机使用自然支持此模式——代理可以查看命令输出并进行调整。

工具设计最佳实践

自定义工具创建

良好的工具设计：

# Clear, focused tool
{
    "name": "get_customer_orders",
    "description": "Retrieve all orders for a specific customer ID",
    "input_schema": {
        "type": "object",
        "properties": {
            "customer_id": {
                "type": "string",
                "description": "The unique customer identifier"
            },
            "since_date": {
                "type": "string",
                "description": "ISO date to filter orders from (optional)"
            }
        },
        "required": ["customer_id"]
    }
}

不良的工具设计：

# Too broad, unclear purpose
{
    "name": "do_customer_stuff",
    "description": "Does various things with customers",
    "input_schema": {
        "type": "object",
        "properties": {
            "action": {"type": "string"},
            "data": {"type": "object"}
        }
    }
}

应该： ✅ 提供与任务相关的工具 ✅ 使用清晰、描述性的名称 ✅ 编写详细的描述（Claude 会阅读这些！） ✅ 定义严格的输入模式 ✅ 在工具中实现错误处理 ✅ 返回结构化的、可解析的输出

不应该： ❌ 给代理不需要的工具（会增加混淆） ❌ 使用模糊的名称，如“handler”或“processor” ❌ 跳过输入验证 ❌ 返回没有上下文的原始错误消息 ❌ 使工具的副作用不明确

连接 MCP 服务器

# Define MCP server connection
mcp_config = {
    "servers": {
        "github": {
            "command": "npx",
            "args": ["-y", "@modelcontextprotocol/server-github"],
            "env": {
                "GITHUB_TOKEN": os.getenv("GITHUB_TOKEN")
            }
        },
        "postgres": {
            "command": "docker",
            "args": ["run", "mcp-postgres-server"],
            "env": {
                "DATABASE_URL": os.getenv("DATABASE_URL")
            }
        }
    }
}

# Claude automatically discovers tools from MCP servers
response = client.messages.create(
    model="claude-sonnet-4-5",
    mcp_servers=mcp_config,
    messages=[{
        "role": "user",
        "content": "Find all GitHub issues assigned to me and update the project database"
    }]
)
# Claude uses both github and postgres MCP tools

自定义 MCP 服务器

# Create custom MCP server for internal API
from mcp import Server, Tool

server = Server("internal-crm")

@server.tool()
def get_customer_data(customer_id: str):
    """Retrieve customer information from internal CRM"""
    return crm_api.get_customer(customer_id)

@server.tool()
def update_customer_notes(customer_id: str, notes: str):
    """Add notes to customer record"""
    return crm_api.update(customer_id, {"notes": notes})

# Deploy and connect to Claude

1. 流式传输以提升用户体验

原因： 实时显示用户进度，建立对代理操作的信任

with client.messages.stream(
    model="claude-sonnet-4-5",
    max_tokens=4096,
    tools=tools,
    messages=messages
) as stream:
    for event in stream:
        if event.type == "content_block_delta":
            print(event.delta.text, end="", flush=True)
        elif event.type == "tool_use":
            print(f"\nUsing tool: {event.name}")

稳健的错误管理：

try:
    response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=tools,
        messages=messages
    )
except anthropic.APIError as e:
    # Handle API errors
    log_error(f"API Error: {e}")
    return fallback_response()
except anthropic.RateLimitError:
    # Handle rate limits
    time.sleep(60)
    retry()
except Exception as e:
    # Handle tool execution errors
    log_error(f"Tool Error: {e}")
    return safe_error_message()

简单任务使用 Claude Haiku，复杂推理使用 Sonnet
为重复上下文实现缓存
尽可能批量处理类似请求
适当限制 max_tokens
通过回调监控令牌使用情况

Use appropriate model for task

simple_task_response = client.messages.create( model="claude-haiku-4", # Cheaper, faster messages=[{"role": "user", "content": "Format this JSON"}] )

complex_task_response = client.messages.create( model="claude-sonnet-4-5", # More capable messages=[{"role": "user", "content": "Analyze architectural trade-offs"}] )

关键安全措施：

# Restrict file access
safe_file_tools = {
    "read": {
        "allowed_paths": ["/data/public"],
        "denied_paths": ["/etc", "/secrets"]
    },
    "write": {
        "allowed_paths": ["/output"],
        "denied_paths": ["/"]
    }
}

def sanitize_bash_command(cmd: str) -> str:
    """Prevent dangerous commands"""
    dangerous = ["rm -rf", ":(){ :|:& };:", "dd if="]
    for danger in dangerous:
        if danger in cmd:
            raise SecurityError(f"Dangerous command blocked: {danger}")
    return cmd

审计日志记录：

def log_agent_action(action: dict):
    """Track all agent actions for security audit"""
    audit_log.write({
        "timestamp": datetime.now(),
        "tool": action["tool_name"],
        "input": action["input"],
        "user": action["user_id"],
        "result": action["result"]
    })

在适当时，Claude 可以同时使用多个工具：

# Claude automatically parallelizes when possible
response = client.messages.create(
    model="claude-sonnet-4-5",
    tools=[weather_api, stock_api, news_api],
    messages=[{
        "role": "user",
        "content": "Give me weather, stock prices, and news for San Francisco"
    }]
)
# Claude calls all 3 APIs in parallel

# Cache system prompts and large contexts
response = client.messages.create(
    model="claude-sonnet-4-5",
    system=[{
        "type": "text",
        "text": large_system_prompt,
        "cache_control": {"type": "ephemeral"}
    }],
    messages=messages
)
# System prompt cached for ~5 minutes

def test_customer_lookup_tool():
    """Test individual tool behavior"""
    result = get_customer_orders("CUST123")
    assert result["customer_id"] == "CUST123"
    assert isinstance(result["orders"], list)

def test_agent_workflow():
    """Test agent using multiple tools"""
    response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=[tool1, tool2, tool3],
        messages=[{
            "role": "user",
            "content": "Process order #12345"
        }]
    )

    # Verify expected tool usage
    tool_calls = extract_tool_calls(response)
    assert "verify_order" in tool_calls
    assert "process_payment" in tool_calls

# Use Claude's built-in evaluation
from anthropic import Anthropic

eval_client = Anthropic()

eval_results = eval_client.evaluate(
    agent=my_agent,
    test_cases=[
        {"input": "...", "expected_output": "..."},
        # More test cases
    ],
    metrics=["accuracy", "latency", "tool_efficiency"]
)

模式：多步骤研究

async def research_agent(query: str):
    """Agent researches topic using multiple sources"""
    response = await client.messages.create(
        model="claude-sonnet-4-5",
        tools=[web_search, web_fetch, summarize],
        messages=[{
            "role": "user",
            "content": f"Research '{query}' and provide comprehensive summary"
        }]
    )
    # Claude: searches → fetches articles → summarizes → synthesizes
    return response.content

模式：代码生成与测试

def code_agent(requirements: str):
    """Agent writes and tests code"""
    response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=[write_file, bash, read_file],
        messages=[{
            "role": "user",
            "content": f"Write and test code for: {requirements}"
        }]
    )
    # Claude: writes code → saves file → runs tests → fixes errors → retries
    return response.content

模式：数据管道

def data_pipeline_agent(source: str, destination: str):
    """Agent ETL pipeline"""
    response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=[read_file, bash, postgres_insert],
        messages=[{
            "role": "user",
            "content": f"Extract data from {source}, transform it, and load to {destination}"
        }]
    )
    # Claude orchestrates full ETL
    return response.content

Claude Sonnet 4.5 (claude-sonnet-4-5)

复杂推理和分析
多步骤自主任务
代码生成和调试
研究和综合
高风险决策

最高能力
最适合计算机使用
更昂贵
比 Haiku 慢

Claude Haiku 4 (claude-haiku-4)

简单、定义明确的任务
格式转换
快速分类
高吞吐量场景
成本敏感型应用

快速响应
成本较低
适合结构化任务
复杂推理能力有限

from fastapi import FastAPI
from anthropic import Anthropic

app = FastAPI()
client = Anthropic()

@app.post("/agent/task")
async def run_agent_task(task: dict):
    response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=load_tools_for_task(task),
        messages=[{
            "role": "user",
            "content": task["description"]
        }]
    )
    return {"result": response.content}

与 LangChain 集成（通过 LangChain-Anthropic）

from langchain_anthropic import ChatAnthropic
from langchain.agents import initialize_agent

llm = ChatAnthropic(model="claude-sonnet-4-5")
agent = initialize_agent(
    tools=[tool1, tool2],
    llm=llm,
    agent_type="structured-chat-zero-shot-react-description"
)
result = agent.run("Complete this task")

监控与可观测性

工具调用成功率 - 工具调用成功的百分比
任务完成率 - 用户请求完全解决的百分比
平均迭代次数 - 每个任务的工具调用次数
延迟 - 完成请求的时间
令牌使用量 - 每个请求的输入+输出令牌数
错误率 - 出现错误的请求百分比

日志记录最佳实践

import logging

logging.basicConfig(level=logging.INFO)
logger = logging.getLogger("claude-agent")

def run_agent_with_logging(task):
    logger.info(f"Starting task: {task}")

    response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=tools,
        messages=[{"role": "user", "content": task}]
    )

    logger.info(f"Tools used: {extract_tools(response)}")
    logger.info(f"Token usage: {response.usage}")

    return response

在以下情况下使用 Claude SDK：

基于 Anthropic 模型（Claude 系列）构建
需要计算机使用能力（文件、bash、迭代）
想要生产就绪的代理框架
需要 MCP 集成以连接数据源
构建自主任务完成代理

在以下情况下考虑替代方案：

已投入 OpenAI 生态系统（使用 AgentKit）
需要可视化代理构建器（使用 AgentKit）
需要复杂的状态机（使用 LangGraph）
想要完全的开源控制（使用 AutoGen/LangGraph）

计算机使用是游戏规则的改变者 - 充分利用文件/bash 能力
工具是一等公民 - 像设计提示词一样精心设计工具
MCP 用于数据 - 使用 MCP 服务器实现企业数据连接
流式传输提升用户体验 - 实时反馈建立用户信任
安全至上 - 验证输入、限制权限、审计操作
为任务选择合适的模型 - 简单任务用 Haiku，复杂任务用 Sonnet

本技能确保您使用 Claude 在 2025 年的尖端能力构建强大、自主的代理。

🇺🇸English

Claude SDK Expert Skill

Purpose

This skill provides comprehensive guidance on building autonomous AI agents using the Claude Agent SDK (formerly Claude Code SDK), leveraging computer use capabilities, tool orchestration, and MCP integration for production deployments.

SDK Overview

Claude Agent SDK (2025)

The Claude Agent SDK enables building autonomous agents that can interact with computers, write files, run commands, and iterate on their work.

Evolution: Renamed from "Claude Code SDK" to reflect broader capabilities beyond coding.

Core Philosophy: Give Claude a computer to unlock agent effectiveness beyond chat-based interactions.

Key Capabilities

1. Computer Use

Revolutionary Feature: Claude can control a computer environment to complete tasks.

What This Enables:

File system operations (read, write, edit)
Terminal command execution
Iterative debugging and refinement
Multi-step autonomous workflows
Real-world task completion

Use Cases:

Finance agents analyzing portfolios
Personal assistants booking travel
Customer support handling complex requests
Development agents building software
Research agents gathering and analyzing data

2. Built-in Tools

File Operations:

Read - Read file contents
Write - Create or overwrite files
Edit - Make targeted edits to existing files

Command Execution:

Bash - Run shell commands and scripts

Search & Discovery:

Grep - Search file contents with regex
Glob - Find files by pattern

Web Access:

WebFetch - Retrieve and analyze web pages
WebSearch - Search the internet for information

All tools are production-tested and optimized for agent use.

3. MCP Integration

Model Context Protocol Support: Define custom tools via MCP servers.

Benefits:

Standardized tool interface
Reusable across different agents
Community ecosystem of MCP servers
Enterprise data source connectivity

Example MCP Servers:

GitHub, Slack, Google Drive
PostgreSQL, MongoDB
Stripe, Salesforce
Custom internal APIs

Architecture Patterns

Pattern 1: Autonomous Task Completion

Scenario: Agent completes multi-step task without human intervention

Flow:

User Request
    ↓
Claude analyzes task
    ↓
Breaks into subtasks
    ↓
Executes via tools (Read, Bash, Write, etc.)
    ↓
Iterates on failures
    ↓
Returns result

Example:

from anthropic import Anthropic

client = Anthropic()

response = client.messages.create(
    model="claude-sonnet-4-5",
    max_tokens=4096,
    tools=[
        {"type": "computer_use"},
        {"type": "bash"},
        {"type": "file_operations"}
    ],
    messages=[{
        "role": "user",
        "content": "Analyze the last 30 days of sales data and create a summary report"
    }]
)

# Claude autonomously:
# 1. Reads sales data files
# 2. Runs analysis scripts
# 3. Generates report
# 4. Saves to file

Pattern 2: Human-in-the-Loop Approval

Scenario: Agent proposes actions, waits for approval before executing

Flow:

Task → Plan → Show to Human → Approve? → Execute → Result
                                ↓ No
                            Revise Plan

Implementation:

# Step 1: Generate plan
plan_response = client.messages.create(
    model="claude-sonnet-4-5",
    messages=[{
        "role": "user",
        "content": "Create a plan to refactor the authentication system"
    }]
)

# Step 2: Human reviews plan
if human_approves(plan_response.content):
    # Step 3: Execute with tools
    execution_response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=all_tools,
        messages=[{
            "role": "user",
            "content": f"Execute this plan: {plan_response.content}"
        }]
    )

Pattern 3: Iterative Refinement

Scenario: Agent iterates on work based on feedback/errors

Flow:

Attempt 1 → Error → Analyze → Attempt 2 → Error → Analyze → Attempt 3 → Success

Built-in: Claude SDK naturally supports this through computer use - agents can see command outputs and adjust.

Tool Design Best Practices

Custom Tool Creation

Good Tool Design:

# Clear, focused tool
{
    "name": "get_customer_orders",
    "description": "Retrieve all orders for a specific customer ID",
    "input_schema": {
        "type": "object",
        "properties": {
            "customer_id": {
                "type": "string",
                "description": "The unique customer identifier"
            },
            "since_date": {
                "type": "string",
                "description": "ISO date to filter orders from (optional)"
            }
        },
        "required": ["customer_id"]
    }
}

Poor Tool Design:

# Too broad, unclear purpose
{
    "name": "do_customer_stuff",
    "description": "Does various things with customers",
    "input_schema": {
        "type": "object",
        "properties": {
            "action": {"type": "string"},
            "data": {"type": "object"}
        }
    }
}

Tool Selection Principles

DO: ✅ Provide tools relevant to the task ✅ Use clear, descriptive names ✅ Write detailed descriptions (Claude reads these!) ✅ Define strict input schemas ✅ Implement error handling in tools ✅ Return structured, parseable outputs

DON'T: ❌ Give agents tools they don't need (increases confusion) ❌ Use ambiguous names like "handler" or "processor" ❌ Skip input validation ❌ Return raw error messages without context ❌ Make tools with side effects unclear

MCP Integration Patterns

Connecting MCP Servers

# Define MCP server connection
mcp_config = {
    "servers": {
        "github": {
            "command": "npx",
            "args": ["-y", "@modelcontextprotocol/server-github"],
            "env": {
                "GITHUB_TOKEN": os.getenv("GITHUB_TOKEN")
            }
        },
        "postgres": {
            "command": "docker",
            "args": ["run", "mcp-postgres-server"],
            "env": {
                "DATABASE_URL": os.getenv("DATABASE_URL")
            }
        }
    }
}

# Claude automatically discovers tools from MCP servers
response = client.messages.create(
    model="claude-sonnet-4-5",
    mcp_servers=mcp_config,
    messages=[{
        "role": "user",
        "content": "Find all GitHub issues assigned to me and update the project database"
    }]
)
# Claude uses both github and postgres MCP tools

Custom MCP Server

# Create custom MCP server for internal API
from mcp import Server, Tool

server = Server("internal-crm")

@server.tool()
def get_customer_data(customer_id: str):
    """Retrieve customer information from internal CRM"""
    return crm_api.get_customer(customer_id)

@server.tool()
def update_customer_notes(customer_id: str, notes: str):
    """Add notes to customer record"""
    return crm_api.update(customer_id, {"notes": notes})

# Deploy and connect to Claude

Production Best Practices

1. Streaming for UX

Why: Show user progress in real-time, build trust in agent actions

with client.messages.stream(
    model="claude-sonnet-4-5",
    max_tokens=4096,
    tools=tools,
    messages=messages
) as stream:
    for event in stream:
        if event.type == "content_block_delta":
            print(event.delta.text, end="", flush=True)
        elif event.type == "tool_use":
            print(f"\nUsing tool: {event.name}")

2. Error Handling

Robust Error Management:

try:
    response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=tools,
        messages=messages
    )
except anthropic.APIError as e:
    # Handle API errors
    log_error(f"API Error: {e}")
    return fallback_response()
except anthropic.RateLimitError:
    # Handle rate limits
    time.sleep(60)
    retry()
except Exception as e:
    # Handle tool execution errors
    log_error(f"Tool Error: {e}")
    return safe_error_message()

3. Cost Optimization

Strategies:

Use Claude Haiku for simple tasks, Sonnet for complex reasoning
Implement caching for repetitive contexts
Batch similar requests when possible
Limit max_tokens appropriately
Monitor token usage via callbacks

Use appropriate model for task

simple_task_response = client.messages.create( model="claude-haiku-4", # Cheaper, faster messages=[{"role": "user", "content": "Format this JSON"}] )

complex_task_response = client.messages.create( model="claude-sonnet-4-5", # More capable messages=[{"role": "user", "content": "Analyze architectural trade-offs"}] )

4. Security

Critical Security Measures:

Tool Permissions:

# Restrict file access
safe_file_tools = {
    "read": {
        "allowed_paths": ["/data/public"],
        "denied_paths": ["/etc", "/secrets"]
    },
    "write": {
        "allowed_paths": ["/output"],
        "denied_paths": ["/"]
    }
}

Input Sanitization:

def sanitize_bash_command(cmd: str) -> str:
    """Prevent dangerous commands"""
    dangerous = ["rm -rf", ":(){ :|:& };:", "dd if="]
    for danger in dangerous:
        if danger in cmd:
            raise SecurityError(f"Dangerous command blocked: {danger}")
    return cmd

Audit Logging:

def log_agent_action(action: dict):
    """Track all agent actions for security audit"""
    audit_log.write({
        "timestamp": datetime.now(),
        "tool": action["tool_name"],
        "input": action["input"],
        "user": action["user_id"],
        "result": action["result"]
    })

Performance Optimization

Parallel Tool Calls

Claude can use multiple tools simultaneously when appropriate:

# Claude automatically parallelizes when possible
response = client.messages.create(
    model="claude-sonnet-4-5",
    tools=[weather_api, stock_api, news_api],
    messages=[{
        "role": "user",
        "content": "Give me weather, stock prices, and news for San Francisco"
    }]
)
# Claude calls all 3 APIs in parallel

Caching Strategies

# Cache system prompts and large contexts
response = client.messages.create(
    model="claude-sonnet-4-5",
    system=[{
        "type": "text",
        "text": large_system_prompt,
        "cache_control": {"type": "ephemeral"}
    }],
    messages=messages
)
# System prompt cached for ~5 minutes

Testing Agents

Unit Testing Tools

def test_customer_lookup_tool():
    """Test individual tool behavior"""
    result = get_customer_orders("CUST123")
    assert result["customer_id"] == "CUST123"
    assert isinstance(result["orders"], list)

Integration Testing

def test_agent_workflow():
    """Test agent using multiple tools"""
    response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=[tool1, tool2, tool3],
        messages=[{
            "role": "user",
            "content": "Process order #12345"
        }]
    )

    # Verify expected tool usage
    tool_calls = extract_tool_calls(response)
    assert "verify_order" in tool_calls
    assert "process_payment" in tool_calls

Evaluation Framework

# Use Claude's built-in evaluation
from anthropic import Anthropic

eval_client = Anthropic()

eval_results = eval_client.evaluate(
    agent=my_agent,
    test_cases=[
        {"input": "...", "expected_output": "..."},
        # More test cases
    ],
    metrics=["accuracy", "latency", "tool_efficiency"]
)

Common Patterns

Pattern: Multi-Step Research

async def research_agent(query: str):
    """Agent researches topic using multiple sources"""
    response = await client.messages.create(
        model="claude-sonnet-4-5",
        tools=[web_search, web_fetch, summarize],
        messages=[{
            "role": "user",
            "content": f"Research '{query}' and provide comprehensive summary"
        }]
    )
    # Claude: searches → fetches articles → summarizes → synthesizes
    return response.content

Pattern: Code Generation & Testing

def code_agent(requirements: str):
    """Agent writes and tests code"""
    response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=[write_file, bash, read_file],
        messages=[{
            "role": "user",
            "content": f"Write and test code for: {requirements}"
        }]
    )
    # Claude: writes code → saves file → runs tests → fixes errors → retries
    return response.content

Pattern: Data Pipeline

def data_pipeline_agent(source: str, destination: str):
    """Agent ETL pipeline"""
    response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=[read_file, bash, postgres_insert],
        messages=[{
            "role": "user",
            "content": f"Extract data from {source}, transform it, and load to {destination}"
        }]
    )
    # Claude orchestrates full ETL
    return response.content

Model Selection

Claude Sonnet 4.5 (claude-sonnet-4-5)

Best For:

Complex reasoning and analysis
Multi-step autonomous tasks
Code generation and debugging
Research and synthesis
High-stakes decisions

Characteristics:

Highest capability
Best for computer use
More expensive
Slower than Haiku

Claude Haiku 4 (claude-haiku-4)

Best For:

Simple, well-defined tasks
Format conversions
Quick classifications
High-throughput scenarios
Cost-sensitive applications

Characteristics:

Fast responses
Lower cost
Good for structured tasks
Limited complex reasoning

Integration Examples

With FastAPI

from fastapi import FastAPI
from anthropic import Anthropic

app = FastAPI()
client = Anthropic()

@app.post("/agent/task")
async def run_agent_task(task: dict):
    response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=load_tools_for_task(task),
        messages=[{
            "role": "user",
            "content": task["description"]
        }]
    )
    return {"result": response.content}

With LangChain (via LangChain-Anthropic)

from langchain_anthropic import ChatAnthropic
from langchain.agents import initialize_agent

llm = ChatAnthropic(model="claude-sonnet-4-5")
agent = initialize_agent(
    tools=[tool1, tool2],
    llm=llm,
    agent_type="structured-chat-zero-shot-react-description"
)
result = agent.run("Complete this task")

Monitoring & Observability

Key Metrics

Tool Call Success Rate - % of tool invocations that succeed
Task Completion Rate - % of user requests fully resolved
Average Iterations - How many tool calls per task
Latency - Time to complete requests
Token Usage - Input + output tokens per request
Error Rate - % of requests with errors

Logging Best Practices

import logging

logging.basicConfig(level=logging.INFO)
logger = logging.getLogger("claude-agent")

def run_agent_with_logging(task):
    logger.info(f"Starting task: {task}")

    response = client.messages.create(
        model="claude-sonnet-4-5",
        tools=tools,
        messages=[{"role": "user", "content": task}]
    )

    logger.info(f"Tools used: {extract_tools(response)}")
    logger.info(f"Token usage: {response.usage}")

    return response

Decision Framework

Use Claude SDK when:

Building on Anthropic models (Claude family)
Need computer use capabilities (file, bash, iteration)
Want production-ready agent framework
Require MCP integration for data sources
Building autonomous task completion agents

Consider alternatives when:

Committed to OpenAI ecosystem (use AgentKit)
Need visual agent builder (use AgentKit)
Require complex state machines (use LangGraph)
Want full OSS control (use AutoGen/LangGraph)

Resources

Official Documentation:

Agent SDK Docs: https://docs.claude.com/en/api/agent-sdk
Computer Use Guide: https://docs.anthropic.com/en/docs/agents/computer-use
MCP Integration: https://modelcontextprotocol.io

GitHub:

Python SDK: https://github.com/anthropics/claude-agent-sdk-python
TypeScript SDK: https://github.com/anthropics/claude-agent-sdk-typescript

Final Principles

Computer Use is Game-Changing - Leverage file/bash capabilities fully
Tools are First-Class - Design tools as carefully as prompts
MCP for Data - Use MCP servers for enterprise data connectivity
Stream for UX - Real-time feedback builds user trust
Security Always - Validate inputs, restrict permissions, audit actions
Right Model for Task - Haiku for simple, Sonnet for complex

This skill ensures you build powerful, autonomous agents using Claude's cutting-edge capabilities in 2025.

Weekly Installs

–

Repository

frankxai/claude…-library

GitHub Stars

First Seen

–

Security Audits

Gen Agent Trust HubFail SocketPass SnykWarn

agent-browser 浏览器自动化工具 - Vercel Labs 命令行网页操作与测试

147,400 周安装

Claude Agent SDK 专家指南：构建自主AI代理，实现计算机控制与MCP集成

🇨🇳中文介绍

Claude SDK 专家技能

目的

SDK 概述

Claude Agent SDK (2025)

核心能力

1. 计算机使用

2. 内置工具

相关 Skills

3. MCP 集成

架构模式

模式 1：自主任务完成

模式 2：人工介入审批

模式 3：迭代优化

工具设计最佳实践

自定义工具创建

工具选择原则

MCP 集成模式

连接 MCP 服务器

自定义 MCP 服务器

生产最佳实践

1. 流式传输以提升用户体验

2. 错误处理

3. 成本优化

Use appropriate model for task

4. 安全性

性能优化

并行工具调用

缓存策略

测试代理

单元测试工具

集成测试

评估框架

常见模式

模式：多步骤研究

模式：代码生成与测试

模式：数据管道

模型选择

Claude Sonnet 4.5 (claude-sonnet-4-5)

Claude Haiku 4 (claude-haiku-4)

集成示例

与 FastAPI 集成

与 LangChain 集成（通过 LangChain-Anthropic）

监控与可观测性

关键指标

日志记录最佳实践

决策框架

资源

最终原则

🇺🇸English

Claude SDK Expert Skill

Purpose

SDK Overview

Claude Agent SDK (2025)

Key Capabilities

1. Computer Use

2. Built-in Tools

3. MCP Integration

Architecture Patterns

Pattern 1: Autonomous Task Completion

Pattern 2: Human-in-the-Loop Approval

Pattern 3: Iterative Refinement

Tool Design Best Practices

Custom Tool Creation

Tool Selection Principles

MCP Integration Patterns

Connecting MCP Servers

Custom MCP Server

Production Best Practices

1. Streaming for UX

2. Error Handling

3. Cost Optimization

Use appropriate model for task

4. Security

Performance Optimization

Parallel Tool Calls

Caching Strategies

Testing Agents

Unit Testing Tools