高级数据工程师技能：生产级AI/ML数据系统架构、MLOps与性能优化

senior-data-engineer by davila7/claude-code-templates

763 周安装量

23,500 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/davila7/claude-code-templates --skill senior-data-engineer

AI/机器学习数据分析开发运维

🇨🇳中文介绍

高级数据工程师

面向生产级 AI/ML/数据系统的世界级高级数据工程师技能。

快速开始

核心能力

# Core Tool 1
python scripts/pipeline_orchestrator.py --input data/ --output results/

# Core Tool 2  
python scripts/data_quality_validator.py --target project/ --analyze

# Core Tool 3
python scripts/etl_performance_optimizer.py --config config.yaml --deploy

核心专长

此技能涵盖以下世界级能力：

先进的生产模式和架构
可扩展的系统设计与实现
大规模性能优化
MLOps 和 DataOps 最佳实践
实时处理与推理
分布式计算框架
模型部署与监控
安全与合规
成本优化
团队领导与指导

技术栈

编程语言： Python, SQL, R, Scala, Go 机器学习框架： PyTorch, TensorFlow, Scikit-learn, XGBoost 数据工具： Spark, Airflow, dbt, Kafka, Databricks 大语言模型框架： LangChain, LlamaIndex, DSPy 部署： Docker, Kubernetes, AWS/GCP/Azure 监控： MLflow, Weights & Biases, Prometheus 数据库： PostgreSQL, BigQuery, Snowflake, Pinecone

参考文档

1. 数据管道架构

完整指南位于 references/data_pipeline_architecture.md，涵盖：

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

2. 数据建模模式

完整工作流文档位于 references/data_modeling_patterns.md，包括：

分步流程
架构设计模式
工具集成指南
性能调优策略
故障排除流程

3. DataOps 最佳实践

技术参考指南位于 references/dataops_best_practices.md，包含：

系统设计原则
实施示例
配置最佳实践
部署策略
监控与可观测性

模式 1：可扩展数据处理

基于分布式计算的企业级数据处理：

水平扩展架构
容错设计
实时与批处理
数据质量验证
性能监控

模式 2：机器学习模型部署

具备高可用性的生产级机器学习系统：

低延迟模型服务
A/B 测试基础设施
特征存储集成
模型监控与漂移检测
自动化重训练管道

模式 3：实时推理

高吞吐量推理系统：

批处理与缓存策略
负载均衡
自动扩缩容
延迟优化
成本优化

测试驱动开发
代码审查与结对编程
文档即代码
版本控制一切
持续集成

监控所有关键环节
自动化部署
使用特性标志发布
金丝雀部署
全面的日志记录

指导初级工程师
推动技术决策
建立编码标准
培养学习文化
跨职能协作

P50: < 50ms
P95: < 100ms
P99: < 200ms

请求/秒: > 1000
并发用户: > 10,000

正常运行时间: 99.9%
错误率: < 0.1%

认证与授权
数据加密（静态与传输中）
PII 处理与匿名化
GDPR/CCPA 合规
定期安全审计
漏洞管理

# Development
python -m pytest tests/ -v --cov
python -m black src/
python -m pylint src/

# Training
python scripts/train.py --config prod.yaml
python scripts/evaluate.py --model best.pth

# Deployment
docker build -t service:v1 .
kubectl apply -f k8s/
helm upgrade service ./charts/

# Monitoring
kubectl logs -f deployment/service
python scripts/health_check.py

高级模式：references/data_pipeline_architecture.md
实施指南：references/data_modeling_patterns.md
技术参考：references/dataops_best_practices.md
自动化脚本：scripts/ 目录

作为世界级的高级专业人士：

技术领导力
- 驱动架构决策
- 指导团队成员
- 建立最佳实践
- 确保代码质量
战略思维
- 与业务目标对齐
- 评估权衡取舍
- 为扩展进行规划
- 管理技术债务
协作
- 跨团队工作
- 有效沟通
- 建立共识
- 分享知识
创新
- 紧跟研究前沿
- 尝试新方法
- 为社区做贡献
- 推动持续改进
生产卓越性
- 确保高可用性
- 主动监控
- 优化性能
- 响应事件

🇺🇸English

Senior Data Engineer

World-class senior data engineer skill for production-grade AI/ML/Data systems.

Quick Start

Main Capabilities

# Core Tool 1
python scripts/pipeline_orchestrator.py --input data/ --output results/

# Core Tool 2  
python scripts/data_quality_validator.py --target project/ --analyze

# Core Tool 3
python scripts/etl_performance_optimizer.py --config config.yaml --deploy

Core Expertise

This skill covers world-class capabilities in:

Advanced production patterns and architectures
Scalable system design and implementation
Performance optimization at scale
MLOps and DataOps best practices
Real-time processing and inference
Distributed computing frameworks
Model deployment and monitoring
Security and compliance
Cost optimization
Team leadership and mentoring

Tech Stack

Languages: Python, SQL, R, Scala, Go ML Frameworks: PyTorch, TensorFlow, Scikit-learn, XGBoost Data Tools: Spark, Airflow, dbt, Kafka, Databricks LLM Frameworks: LangChain, LlamaIndex, DSPy Deployment: Docker, Kubernetes, AWS/GCP/Azure Monitoring: MLflow, Weights & Biases, Prometheus Databases: PostgreSQL, BigQuery, Snowflake, Pinecone

Reference Documentation

1. Data Pipeline Architecture

Comprehensive guide available in references/data_pipeline_architecture.md covering:

Advanced patterns and best practices
Production implementation strategies
Performance optimization techniques
Scalability considerations
Security and compliance
Real-world case studies

2. Data Modeling Patterns

Complete workflow documentation in references/data_modeling_patterns.md including:

Step-by-step processes
Architecture design patterns
Tool integration guides
Performance tuning strategies
Troubleshooting procedures

3. Dataops Best Practices

Technical reference guide in references/dataops_best_practices.md with:

System design principles
Implementation examples
Configuration best practices
Deployment strategies
Monitoring and observability

Production Patterns

Pattern 1: Scalable Data Processing

Enterprise-scale data processing with distributed computing:

Horizontal scaling architecture
Fault-tolerant design
Real-time and batch processing
Data quality validation
Performance monitoring

Pattern 2: ML Model Deployment

Production ML system with high availability:

Model serving with low latency
A/B testing infrastructure
Feature store integration
Model monitoring and drift detection
Automated retraining pipelines

Pattern 3: Real-Time Inference

High-throughput inference system:

Batching and caching strategies
Load balancing
Auto-scaling
Latency optimization
Cost optimization

Best Practices

Development

Test-driven development
Code reviews and pair programming
Documentation as code
Version control everything
Continuous integration

Production

Monitor everything critical
Automate deployments
Feature flags for releases
Canary deployments
Comprehensive logging

Team Leadership

Mentor junior engineers
Drive technical decisions
Establish coding standards
Foster learning culture
Cross-functional collaboration

Performance Targets

Latency:

P50: < 50ms
P95: < 100ms
P99: < 200ms

Throughput:

Requests/second: > 1000
Concurrent users: > 10,000

Availability:

Uptime: 99.9%
Error rate: < 0.1%

Security & Compliance

Authentication & authorization
Data encryption (at rest & in transit)
PII handling and anonymization
GDPR/CCPA compliance
Regular security audits
Vulnerability management

Common Commands

# Development
python -m pytest tests/ -v --cov
python -m black src/
python -m pylint src/

# Training
python scripts/train.py --config prod.yaml
python scripts/evaluate.py --model best.pth

# Deployment
docker build -t service:v1 .
kubectl apply -f k8s/
helm upgrade service ./charts/

# Monitoring
kubectl logs -f deployment/service
python scripts/health_check.py

Resources

Advanced Patterns: references/data_pipeline_architecture.md
Implementation Guide: references/data_modeling_patterns.md
Technical Reference: references/dataops_best_practices.md
Automation Scripts: scripts/ directory

Senior-Level Responsibilities

As a world-class senior professional:

Technical Leadership
- Drive architectural decisions
- Mentor team members
- Establish best practices
- Ensure code quality
Strategic Thinking
- Align with business goals
- Evaluate trade-offs
- Plan for scale
- Manage technical debt
Collaboration
- Work across teams
- Communicate effectively
- Build consensus
- Share knowledge
Innovation
- Stay current with research
- Experiment with new approaches
- Contribute to community
- Drive continuous improvement
Production Excellence
- Ensure high availability
- Monitor proactively
- Optimize performance
- Respond to incidents

Weekly Installs

752

Repository

davila7/claude-…emplates

GitHub Stars

23.4K

First Seen

Jan 20, 2026

Security Audits

Gen Agent Trust HubPass SocketPass SnykPass

Installed on

opencode596

codex583

gemini-cli562

github-copilot512

cursor497

claude-code496

Azure Data Explorer (Kusto) 查询技能：KQL数据分析、日志遥测与时间序列处理

98,500 周安装