S
SkillsMD
首页Skills怎么安装Skills →
S
SkillsMD

发现、学习和掌握最新的 AI 技术 Skills。基于真实社区数据,为开发者提供最权威的 AI 工具导航。

导航

  • 首页
  • Skills

关于

  • 聚焦 AI 技术 Skills
  • 每周数据更新
  • 中英双语文档

© 2026 SkillsMD. All rights reserved.

闽ICP备10022555号-10

Built with Next.js · Tailwind CSS · MySQL

Skills 列表

探索所有 AI 技术 Skills共 9 个

分类

排序:
9 个结果
Skillverl-rl-traininggithub.com

verl-rl-training:字节跳动开源大语言模型强化学习训练库,支持PPO/GRPO/DAPO等算法

verl-rl-training by orchestra-research/ai-research-skills

Stars5.6k
周安装
69/周

Install Command

CLI
npx skills add https://github.com/orchestra-research/ai-research-skills --skill verl-rl-training

查看详情、安装说明与相关 Skills

查看详情
Skillslime-rl-traininggithub.com

Slime-RL-Training:清华大学THUDM开发的强化学习LLM后训练框架,支持GLM/Qwen3/DeepSeek/Llama

slime-rl-training by orchestra-research/ai-research-skills

Stars5.7k周安装69/周

Install Command

CLI
npx skills add https://github.com/orchestra-research/ai-research-skills --skill slime-rl-training

查看详情、安装说明与相关 Skills

查看详情
Skillgrpo-rl-traininggithub.com

GRPO强化学习训练指南:使用TRL库微调语言模型,优化输出格式与推理能力

grpo-rl-training by orchestra-research/ai-research-skills

Stars5.6k周安装68/周

Install Command

CLI
npx skills add https://github.com/orchestra-research/ai-research-skills --skill grpo-rl-training

查看详情、安装说明与相关 Skills

查看详情
Skillmiles-rl-traininggithub.com

miles-rl-training:企业级强化学习框架,支持大规模MoE模型FP8/INT4训练

miles-rl-training by orchestra-research/ai-research-skills

Stars5.5k周安装65/周

Install Command

CLI
npx skills add https://github.com/orchestra-research/ai-research-skills --skill miles-rl-training

查看详情、安装说明与相关 Skills

查看详情
Skillopenrlhf-traininggithub.com

OpenRLHF高性能RLHF训练框架:基于Ray的分布式强化学习人类反馈优化

openrlhf-training by orchestra-research/ai-research-skills

Stars5.5k周安装65/周

Install Command

CLI
npx skills add https://github.com/orchestra-research/ai-research-skills --skill openrlhf-training

查看详情、安装说明与相关 Skills

查看详情
Skillconstitutional-aigithub.com

宪法式人工智能(Constitutional AI)原理与实践:基于AI反馈的无害性训练方法

constitutional-ai by orchestra-research/ai-research-skills

Stars5.5k周安装64/周

Install Command

CLI
npx skills add https://github.com/orchestra-research/ai-research-skills --skill constitutional-ai

查看详情、安装说明与相关 Skills

查看详情
Skilltorchforge-rl-traininggithub.com

torchforge:PyTorch原生强化学习库,实现快速RL算法实验与分布式训练

torchforge-rl-training by orchestra-research/ai-research-skills

Stars5.5k周安装64/周

Install Command

CLI
npx skills add https://github.com/orchestra-research/ai-research-skills --skill torchforge-rl-training

查看详情、安装说明与相关 Skills

查看详情
Skillfine-tuning-with-trlgithub.com

TRL 强化学习微调指南:SFT、DPO、PPO 完整流程与代码示例

fine-tuning-with-trl by orchestra-research/ai-research-skills

Stars5.5k周安装63/周

Install Command

CLI
npx skills add https://github.com/orchestra-research/ai-research-skills --skill fine-tuning-with-trl

查看详情、安装说明与相关 Skills

查看详情
Skillmiles-rl-traininggithub.com

miles-rl-training:企业级强化学习框架,支持大规模MoE模型FP8/INT4量化训练

miles-rl-training by davila7/claude-code-templates

Stars2.4万周安装62/周

Install Command

CLI
npx skills add https://github.com/davila7/claude-code-templates --skill miles-rl-training

查看详情、安装说明与相关 Skills

查看详情

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者,精准高效

联系我们