ppt-generation by bytedance/deer-flow
npx skills add https://github.com/bytedance/deer-flow --skill ppt-generation此技能通过为每张幻灯片创建 AI 生成的图像并将其组合成 PPTX 文件,来生成专业的 PowerPoint 演示文稿。工作流程包括:使用一致的视觉风格规划演示文稿结构,按顺序生成幻灯片图像(使用前一张幻灯片作为参考以保持风格一致性),并将其组装成最终的演示文稿。
创建演示文稿计划时,请选择以下风格之一:
| 风格 | 描述 | 最佳适用场景 |
|---|---|---|
| glassmorphism | 具有模糊效果的磨砂玻璃面板,漂浮的半透明卡片,鲜艳的渐变背景,通过分层实现深度感 | 科技产品、AI/SaaS 演示、未来主义推介 |
| dark-premium | 浓郁的黑色背景 (#0a0a0a),明亮的强调色,微妙的发光效果,奢侈品牌美学 | 高端产品、高管演示、高端品牌 |
| gradient-modern | 大胆的网格渐变,流畅的色彩过渡,现代字体排印,鲜艳而精致 | 初创公司、创意机构、品牌发布 |
| neo-brutalist | 原始粗犷的字体排印,高对比度,刻意的"丑陋"美学,反设计即设计,孟菲斯风格启发 | 前卫品牌、面向 Z 世代、颠覆性初创公司 |
广告位招租
在这里展示您的产品或服务
触达数万 AI 开发者,精准高效
| 3d-isometric | 简洁的等距插图,漂浮的 3D 元素,柔和的阴影,科技感美学 | 技术讲解、产品特性、SaaS 演示 |
| editorial | 杂志级版面设计,精致的字体排印层次,戏剧性的摄影,Vogue/Bloomberg 美学 | 年度报告、奢侈品牌、思想领导力 |
| minimal-swiss | 基于网格的精确性,Helvetica 风格的字体排印,大胆使用负空间,永恒的现代主义 | 建筑、设计公司、高端咨询 |
| keynote | Apple 风格美学,采用大胆的字体排印、戏剧性的图像、高对比度、电影感 | 主题演讲、产品发布、鼓舞人心的演讲 |
当用户请求生成演示文稿时,请确定:
/mnt/user-data 下的文件夹在 /mnt/user-data/workspace/ 中创建一个包含演示文稿结构的 JSON 文件。重要:包含 style 字段以定义整体视觉一致性。
{
"title": "演示文稿标题",
"style": "keynote",
"style_guidelines": {
"color_palette": "深黑色背景,白色文字,单一强调色(蓝色或橙色)",
"typography": "粗体无衬线标题,简洁的正文文本,戏剧性的尺寸对比",
"imagery": "高质量摄影,全出血图像,电影感构图",
"layout": "充足的留白,居中对焦,每张幻灯片元素极简"
},
"aspect_ratio": "16:9",
"slides": [
{
"slide_number": 1,
"type": "title",
"title": "主标题",
"subtitle": "副标题或标语",
"visual_description": "用于图像生成的详细描述"
},
{
"slide_number": 2,
"type": "content",
"title": "幻灯片标题",
"key_points": ["要点 1", "要点 2", "要点 3"],
"visual_description": "用于图像生成的详细描述"
}
]
}
重要:严格按顺序一张一张地生成幻灯片。请勿并行或批量生成图像。每张幻灯片都依赖于前一张幻灯片的输出作为参考图像。并行生成幻灯片会破坏视觉一致性,这是不允许的。
阅读图像生成技能:/mnt/skills/public/image-generation/SKILL.md
对于第一张幻灯片(幻灯片 1),创建一个用于确立视觉风格的提示:
{
"prompt": "专业演示文稿幻灯片。[计划中的 style_guidelines]。标题:'您的标题'。[visual_description]。此幻灯片为整个演示文稿确立了视觉语言。",
"style": "[基于所选风格 - 例如,Apple Keynote 美学,戏剧性灯光,电影感]",
"composition": "具有清晰文本层次的简洁布局,[特定风格的构图]",
"color_palette": "[来自 style_guidelines]",
"typography": "[来自 style_guidelines]"
}
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/slide-01-prompt.json \
--output-file /mnt/user-data/outputs/slide-01.jpg \
--aspect-ratio 16:9
{
"prompt": "延续参考图像视觉风格的专业演示文稿幻灯片。保持相同的调色板、字体排印风格和整体美学。标题:'幻灯片标题'。[visual_description]。保持与参考图像的视觉一致性。",
"style": "与参考图像的风格完全匹配",
"composition": "与参考图像相似的布局原则,根据此内容进行调整",
"color_palette": "与参考图像相同",
"consistency_note": "此幻灯片必须看起来与参考图像属于同一演示文稿"
}
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/slide-02-prompt.json \
--reference-images /mnt/user-data/outputs/slide-01.jpg \
--output-file /mnt/user-data/outputs/slide-02.jpg \
--aspect-ratio 16:9
# 幻灯片 3 参考幻灯片 2
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/slide-03-prompt.json \
--reference-images /mnt/user-data/outputs/slide-02.jpg \
--output-file /mnt/user-data/outputs/slide-03.jpg \
--aspect-ratio 16:9
# 幻灯片 4 参考幻灯片 3
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/slide-04-prompt.json \
--reference-images /mnt/user-data/outputs/slide-03.jpg \
--output-file /mnt/user-data/outputs/slide-04.jpg \
--aspect-ratio 16:9
所有幻灯片图像生成完成后,调用组合脚本:
python /mnt/skills/public/ppt-generation/scripts/generate.py \
--plan-file /mnt/user-data/workspace/presentation-plan.json \
--slide-images /mnt/user-data/outputs/slide-01.jpg /mnt/user-data/outputs/slide-02.jpg /mnt/user-data/outputs/slide-03.jpg \
--output-file /mnt/user-data/outputs/presentation.pptx
参数:
--plan-file:演示文稿计划 JSON 文件的绝对路径(必需)--slide-images:按顺序排列的幻灯片图像的绝对路径(必需,以空格分隔)--output-file:输出 PPTX 文件的绝对路径(必需)[!注意] 请勿读取 python 文件,只需使用参数调用它。
用户请求:"创建一个关于 AI 产品发布的演示文稿"
创建 /mnt/user-data/workspace/ai-product-plan.json:
{
"title": "Introducing Nova AI",
"style": "glassmorphism",
"style_guidelines": {
"color_palette": "鲜艳的紫色到青色渐变背景 (#667eea→#00d4ff),具有 15-20% 白色不透明度的磨砂玻璃面板,电光强调色",
"typography": "SF Pro Display 风格,带有微妙文本阴影的粗体 700 字重白色标题,干净的 400 字重正文文本,在玻璃上具有出色的对比度",
"imagery": "抽象的 3D 玻璃球体,漂浮的半透明几何形状,柔和发光的球体,通过分层透明度实现深度感",
"layout": "居中的磨砂玻璃卡片,圆角 32px,内边距 48-64px,漂浮在渐变背景之上,通过柔和阴影实现分层深度",
"effects": "玻璃面板的背景模糊 20-40px,微妙的白色边框发光,与渐变匹配的柔和彩色阴影,光线折射效果",
"visual_language": "Apple Vision Pro / visionOS 美学,通过透明度实现高级深度感,未来主义且平易近人,2024 年设计趋势"
},
"aspect_ratio": "16:9",
"slides": [
{
"slide_number": 1,
"type": "title",
"title": "Introducing Nova AI",
"subtitle": "Intelligence, Reimagined",
"visual_description": "令人惊叹的渐变背景,从深紫色 (#667eea) 经过洋红色流向青色 (#00d4ff)。中央:具有强烈背景模糊效果的大型磨砂玻璃面板,包含粗体白色标题 'Introducing Nova AI' 和较浅的副标题。卡片周围漂浮着 3D 玻璃球体和抽象形状,营造深度感。玻璃面板后方散发出柔和的光芒。高级 visionOS 美学。玻璃卡片具有微妙的白色边框 (1px rgba 255,255,255,0.3) 和柔和的紫色调阴影。"
},
{
"slide_number": 2,
"type": "content",
"title": "Why Nova?",
"key_points": ["10x faster processing", "Human-like understanding", "Enterprise-grade security"],
"visual_description": "相同的紫色-青色渐变背景。左侧:漂浮的磨砂玻璃卡片,标题为粗体白色的 'Why Nova?',下方是三个要点,带有微妙的玻璃药丸状徽章。右侧:神经网络的抽象 3D 可视化,呈现为相互连接的玻璃节点,带有柔和发光效果。漂浮的半透明几何形状(二十面体、圆环)增加了深度感。与上一张幻灯片一致的玻璃态美学。"
},
{
"slide_number": 3,
"type": "content",
"title": "How It Works",
"key_points": ["Natural language input", "Multi-modal processing", "Instant insights"],
"visual_description": "与之前幻灯片一致的渐变背景。中央构图:三个以轻微角度堆叠的磨砂玻璃卡片,展示工作流程步骤,由柔和的发光线条连接。每个卡片都有一个抽象图标。构图周围漂浮着玻璃球体和光粒子。顶部是粗体白色的标题 'How It Works'。通过卡片分层和透明度营造深度感。"
},
{
"slide_number": 4,
"type": "content",
"title": "Built for Scale",
"key_points": ["1M+ concurrent users", "99.99% uptime", "Global infrastructure"],
"visual_description": "相同的渐变背景。不对称布局:右侧是大型磨砂玻璃面板,以粗体字体显示指标。左侧:由玻璃面板和连接线构成的抽象 3D 地球仪,代表全球规模。漂浮的数据可视化元素作为带有数字的小型玻璃卡片。整体呈现柔和的氛围光。高级科技美学。"
},
{
"slide_number": 5,
"type": "conclusion",
"title": "The Future Starts Now",
"subtitle": "Join the waitlist",
"visual_description": "戏剧性的结尾幻灯片。渐变背景,饱和度略有增加。中央磨砂玻璃卡片带有粗体标题 'The Future Starts Now' 和行动号召副标题。卡片后方:柔和光线射线和漂浮玻璃粒子的爆发,营造庆祝效果。多个分层的玻璃形状营造深度感。最具视觉冲击力的幻灯片,同时保持风格一致性。"
}
]
}
阅读 /mnt/skills/public/image-generation/SKILL.md 以了解如何生成图像。
幻灯片 1 - 标题(确立视觉语言):
创建 /mnt/user-data/workspace/nova-slide-01.json:
{
"prompt": "采用玻璃态设计的超高级演示文稿标题幻灯片。背景:从深紫色 (#667eea) 经过洋红色 (#f093fb) 到青色 (#00d4ff) 的平滑流动渐变,柔和而鲜艳。中央:具有强烈背景模糊效果的大型磨砂玻璃面板,圆角 32px,包含粗体白色无衬线标题 'Introducing Nova AI'(72pt,SF Pro Display 风格,字重 700),带有微妙的文本阴影,下方是较轻字重的副标题 'Intelligence, Reimagined'。玻璃面板具有微妙的白色边框 (1px rgba 255,255,255,0.25) 和柔和的紫色调投影。卡片周围漂浮:带有折射效果的 3D 玻璃球体,半透明几何形状(二十面体、抽象斑点),营造深度和维度感。玻璃面板后方散发出柔和的发光效果。微小的漂浮光粒子。Apple Vision Pro / visionOS UI 美学。专业演示文稿幻灯片,16:9 宽高比。超现代、高级科技产品发布感。",
"style": "Glassmorphism, visionOS 美学, Apple Vision Pro UI 风格, 高级科技, 2024 年设计趋势",
"composition": "居中的玻璃卡片作为焦点,漂浮的 3D 元素在边缘营造深度,40% 负空间,清晰的视觉层次",
"lighting": "来自渐变的柔和环境光,通过玻璃元素的光线折射,3D 形状上微妙的边缘光",
"color_palette": "紫色渐变 #667eea, 洋红色 #f093fb, 青色 #00d4ff, 磨砂白色 rgba(255,255,255,0.15), 纯白色文本 #ffffff",
"effects": "玻璃面板的背景模糊,带有色彩色调的柔和投影,光线折射,玻璃上微妙的噪点纹理,漂浮粒子"
}
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/nova-slide-01.json \
--output-file /mnt/user-data/outputs/nova-slide-01.jpg \
--aspect-ratio 16:9
幻灯片 2 - 内容(必须参考幻灯片 1 以保持一致性):
创建 /mnt/user-data/workspace/nova-slide-02.json:
{
"prompt": "演示文稿幻灯片,延续参考图像的**精确**视觉风格。**相同**的紫色到青色渐变背景,**相同**的玻璃态美学,**相同**的字体排印风格。左侧:具有背景模糊效果的磨砂玻璃卡片,包含粗体白色标题 'Why Nova?'(匹配参考字体风格),下方是三个作为微妙玻璃药丸状徽章的特性要点。右侧:由相互连接的玻璃节点构成的抽象 3D 神经网络可视化,带有柔和的青色发光效果,漂浮在空间中。漂浮的半透明几何形状(匹配参考风格)增加了深度感。磨砂玻璃具有相同的处理方式:白色边框,紫色调阴影,相同的模糊强度。**关键**:此幻灯片必须看起来与参考图像属于**完全相同的**演示文稿 - 相同的颜色,相同的玻璃处理方式,相同的整体美学。",
"style": "**精确匹配参考** - Glassmorphism, visionOS 美学, 相同的视觉语言",
"composition": "不对称分割:左侧玻璃卡片 (40%),右侧 3D 可视化 (40%),元素之间有呼吸空间",
"color_palette": "**精确匹配参考**:紫色 #667eea, 青色 #00d4ff 渐变,相同的磨砂白色处理,相同的文本白色",
"consistency_note": "**关键**:必须在风格上与参考图像视觉上完全相同。相同的渐变颜色,相同的玻璃模糊强度,相同的阴影处理,相同的字体字重和风格。观看者应能立即认出这是同一演示文稿。"
}
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/nova-slide-02.json \
--reference-images /mnt/user-data/outputs/nova-slide-01.jpg \
--output-file /mnt/user-data/outputs/nova-slide-02.jpg \
--aspect-ratio 16:9
幻灯片 3-5:继续相同的模式,每张都参考前一张幻灯片
后续幻灯片的关键一致性规则:
consistency_notepython /mnt/skills/public/ppt-generation/scripts/generate.py \
--plan-file /mnt/user-data/workspace/nova-plan.json \
--slide-images /mnt/user-data/outputs/nova-slide-01.jpg /mnt/user-data/outputs/nova-slide-02.jpg /mnt/user-data/outputs/nova-slide-03.jpg /mnt/user-data/outputs/nova-slide-04.jpg /mnt/user-data/outputs/nova-slide-05.jpg \
--output-file /mnt/user-data/outputs/nova-presentation.pptx
{
"style": "glassmorphism",
"style_guidelines": {
"color_palette": "鲜艳的渐变背景(紫色 #667eea 到粉色 #f093fb,或青色 #4facfe 到蓝色 #00f2fe),具有 20% 不透明度的磨砂白色面板,在渐变背景下突出的强调色",
"typography": "SF Pro Display 或 Inter 字体风格,粗体 600-700 字重标题,干净的 400 字重正文,白色文本带有微妙的投影以在玻璃上保持可读性",
"imagery": "漂浮在空间中的抽象 3D 形状,柔和的模糊球体,具有玻璃材质的几何基本体,通过重叠的半透明层实现深度感",
"layout": "具有背景模糊效果的漂浮卡片面板,充足的内边距 (48-64px),圆角 (24-32px 半径),通过微妙的阴影实现分层深度",
"effects": "磨砂玻璃模糊 (backdrop-filter: blur 20px),微妙的白色边框 (1px rgba 255,255,255,0.2),面板后方的柔和发光,带有投影的漂浮元素",
"visual_language": "类似 Apple Vision Pro UI 的高级科技美学,通过透明度实现深度感,光线穿过玻璃表面折射"
}
}
{
"style": "dark-premium",
"style_guidelines": {
"color_palette": "深黑色基调 (#0a0a0a 到 #121212),明亮的强调色(电光蓝 #00d4ff,霓虹紫 #bf5af2,或金色 #ffd700),用于深度的微妙灰色渐变 (#1a1a1a 到 #0a0a0a)",
"typography": "优雅的无衬线字体(Neue Haas Grotesk 或 Suisse Int'l 风格),戏剧性的尺寸对比 (72pt+ 标题,18pt 正文),标题字母间距 -0.02em,纯白色 (#ffffff) 文本",
"imagery": "戏剧性的影棚灯光,边缘光和发光效果,电影感产品拍摄,抽象光迹,高级材质纹理(拉丝金属,哑光表面)",
"layout": "充足的负空间 (60%+),不对称平衡,内容锚定在网格上但留有呼吸空间,每张幻灯片单一焦点",
"effects": "关键元素后方的微妙环境光,光晕效果,颗粒纹理叠加 (2-3% 不透明度),边缘暗角",
"visual_language": "奢侈科技品牌美学(Bang & Olufsen, Porsche Design),通过克制体现精致感,每个元素都经过深思熟虑"
}
}
{
"style": "gradient-modern",
"style_guidelines": {
"color_palette": "大胆的网格渐变(Stripe/Linear 风格:紫-粉-橙 #7c3aed→#ec4899→#f97316,或冷色调:青-蓝-紫 #06b6d4→#3b82f6→#8b5cf6),根据背景强度使用白色或深色文本",
"typography": "现代几何无衬线字体(Satoshi, General Sans, 或 Clash Display 风格),可变字体字重,超大粗体标题 (80pt+),舒适的正文文本 (20pt)",
"imagery": "抽象流体形状,变形渐变,3D 渲染的抽象对象,柔和的有机形态,漂浮的几何基本体",
"layout": "动态的不对称构图,具有混合模式的重叠元素,与渐变流集成的文本,全出血背景",
"effects": "平滑的渐变过渡,微妙的噪点纹理 (3-5% 用于深度),与渐变匹配的色彩色调柔和阴影,暗示运动的运动模糊",
"visual_language": "当代 SaaS 美学(Stripe, Linear, Vercel),充满活力且专业,前瞻性的科技氛围"
}
}
{
"style": "neo-brutalist",
"style_guidelines": {
"color_palette": "高对比度原色:鲜明的黑色,纯白色,搭配大胆的强调色(亮粉色 #ff0080,电光黄 #ffff00,或原始红 #ff0000),可选:孟菲斯风格的柔和色彩作为次要色",
"typography": "超粗体紧缩字体(Impact, Druk, 或 Bebas Neue 风格),大写标题,极端的尺寸对比,刻意紧凑或重叠的字母间距",
"imagery": "原始未过滤的摄影,刻意的视觉噪点,半色调图案,剪贴拼贴美学,手绘元素,贴纸和印章",
"layout": "破碎的网格,重叠的元素,粗黑色边框 (4-8px),可见的结构,反留白(密集但有组织的混乱)",
"effects": "硬阴影(无模糊,偏移 8-12px),像素化点缀,扫描线,CRT 屏幕效果,刻意的'错误'",
"visual_language": "反企业叛逆,DIY 杂志美学与数字结合,原始的真实性,通过大胆令人难忘"
}
}
{
"style": "3d-isometric",
"style_guidelines": {
"color_palette": "柔和的当代调色板:柔和的紫色 (#8b5cf6),蓝绿色 (#14b8a6),暖珊瑚色 (#fb7185),搭配奶油色或浅灰色背景 (#fafafa),元素间饱和度一致",
"typography": "友好的几何无衬线字体(Circular, Gilroy, 或 Quicksand 风格),中等字重标题,出色的可读性,舒适的 24pt 正文文本",
"imagery": "简洁的等距 3D 插图,一致的 30° 等距角度,柔和的粘土渲染美学,漂浮的平台和设备,可爱的简化对象",
"layout": "中央等距场景作为主视觉,文本围绕 3D 元素平衡,清晰的视觉层次,舒适的边距 (64px+)",
"effects": "柔和的投影 (20px 模糊,30% 不透明度),3D 对象上的环境光遮蔽,表面上的微妙渐变,一致的光源(左上角)",
"visual_language": "友好的科技插图(Slack, Notion, Asana 风格),平易近人的复杂性,通过简化实现清晰度"
}
}
{
"style": "editorial",
"style_guidelines": {
"color_palette": "精致的中性色:灰白色 (#f5f5f0),木炭色 (#2d2d2d),搭配单一强调色(勃艮第红 #7c2d12,森林绿 #14532d,或海军蓝 #1e3a5f),偶尔使用全彩摄影",
"typography": "精致的衬线字体用于标题(Playfair Display, Freight, 或 Editorial New 风格),干净的 sans-serif 用于正文(Söhne, Graphik),戏剧性的尺寸层次 (96pt 标题,16pt 正文),充足的行高 1.6",
"imagery": "杂志级摄影,戏剧性的裁剪,全出血图像,带有刻意负空间的人像,社论式灯光(Vogue, Bloomberg Businessweek 风格)",
"layout": "精致的网格系统(12 列),刻意的不对称,作为设计元素的引文,文本环绕图像,优雅的边距",
"effects": "最少的效果 - 让摄影和字体排印发光,微妙的图像处理(轻微去饱和度,胶片颗粒),优雅的边框和线条",
"visual_language": "高端杂志美学,知性的精致感,通过设计克制提升内容"
}
}
{
"style": "minimal-swiss",
"style_guidelines": {
"color_palette": "纯白色 (#ffffff) 或灰白色 (#fafaf9) 背景,纯黑色 (#000000) 文本,单一粗体强调色(瑞士红 #ff0000,克莱因蓝 #002fa7,或信号黄 #ffcc00)",
"typography": "Helvetica Neue 或 Aktiv Grotesk,严格的字体比例尺 (12/16/24/48/96),正文中等字重,仅使用粗体进行强调,左对齐右不齐",
"imagery": "客观的摄影,几何形状,简洁的图标,数学精度,刻意留空作为构图元素",
"layout": "严格遵守网格(基线网格在精神上可见),模块化构图,充足的留白(幻灯片 40%+ 以上),内容对齐到不可见的网格线",
"effects": "无 - 形式的纯粹性,无阴影,无渐变,无装饰元素,偶尔使用单一线条",
"visual_language": "国际字体排印风格,形式追随功能,永恒的现代主义,Dieter Rams 启发的克制"
}
}
{
"style": "keynote",
"style_guidelines": {
"color_palette": "深黑色 (#000000 到 #1d1d1f),纯白色文本,标志性蓝色 (#0071e3) 或渐变强调色(创意用紫-粉,科技用蓝-青)",
"typography": "San Francisco Pro Display,极端的字重对比(粗体 80pt+ 标题,细体 24pt 正文),标题负字母间距 (-0.03em),光学对齐",
"imagery": "电影感摄影,浅景深,戏剧性灯光(边缘光,聚光灯),带有倒影的产品主视觉拍摄,全出血图像",
"layout": "最大化的负空间,每张幻灯片单一强大的图像或陈述,内容居中或戏剧性地偏移,无杂乱",
"effects": "微妙的渐变叠加,关键元素上的光晕和发光效果,表面倒影,平滑的渐变背景",
"visual_language": "Apple WWDC 主题演讲美学,通过简洁体现自信,每个像素都经过考量,戏剧性的演示"
}
}
生成后:
/mnt/user-data/outputs/ 中present_files 工具与用户共享生成的演示文稿专业结果的提示工程:
视觉一致性(最重要):
consistency_note 字段现代美学的设计原则:
需要避免的常见错误:
不同场景的推荐风格:
glassmorphism 或 gradient-moderndark-premium 或 editorialgradient-modern 或 minimal-swissdark-premium 或 keynoteneo-brutalist 或 gradient-modernminimal-swiss 或 3d-isometric每周安装量
310
代码仓库
GitHub 星标数
29.5K
首次出现
2026年2月17日
安全审计
安装于
opencode304
codex301
gemini-cli301
github-copilot301
kimi-cli300
amp300
This skill generates professional PowerPoint presentations by creating AI-generated images for each slide and composing them into a PPTX file. The workflow includes planning the presentation structure with a consistent visual style, generating slide images sequentially (using the previous slide as a reference for style consistency), and assembling them into a final presentation.
Choose one of the following styles when creating the presentation plan:
| Style | Description | Best For |
|---|---|---|
| glassmorphism | Frosted glass panels with blur effects, floating translucent cards, vibrant gradient backgrounds, depth through layering | Tech products, AI/SaaS demos, futuristic pitches |
| dark-premium | Rich black backgrounds (#0a0a0a), luminous accent colors, subtle glow effects, luxury brand aesthetic | Premium products, executive presentations, high-end brands |
| gradient-modern | Bold mesh gradients, fluid color transitions, contemporary typography, vibrant yet sophisticated | Startups, creative agencies, brand launches |
| neo-brutalist | Raw bold typography, high contrast, intentional "ugly" aesthetic, anti-design as design, Memphis-inspired | Edgy brands, Gen-Z targeting, disruptive startups |
| 3d-isometric | Clean isometric illustrations, floating 3D elements, soft shadows, tech-forward aesthetic | Tech explainers, product features, SaaS presentations |
| editorial | Magazine-quality layouts, sophisticated typography hierarchy, dramatic photography, Vogue/Bloomberg aesthetic | Annual reports, luxury brands, thought leadership |
| minimal-swiss | Grid-based precision, Helvetica-inspired typography, bold use of negative space, timeless modernism | Architecture, design firms, premium consulting |
| keynote | Apple-inspired aesthetic with bold typography, dramatic imagery, high contrast, cinematic feel | Keynotes, product reveals, inspirational talks |
When a user requests presentation generation, identify:
/mnt/user-dataCreate a JSON file in /mnt/user-data/workspace/ with the presentation structure. Important : Include the style field to define the overall visual consistency.
{
"title": "Presentation Title",
"style": "keynote",
"style_guidelines": {
"color_palette": "Deep black backgrounds, white text, single accent color (blue or orange)",
"typography": "Bold sans-serif headlines, clean body text, dramatic size contrast",
"imagery": "High-quality photography, full-bleed images, cinematic composition",
"layout": "Generous whitespace, centered focus, minimal elements per slide"
},
"aspect_ratio": "16:9",
"slides": [
{
"slide_number": 1,
"type": "title",
"title": "Main Title",
"subtitle": "Subtitle or tagline",
"visual_description": "Detailed description for image generation"
},
{
"slide_number": 2,
"type": "content",
"title": "Slide Title",
"key_points": ["Point 1", "Point 2", "Point 3"],
"visual_description": "Detailed description for image generation"
}
]
}
IMPORTANT : Generate slides strictly one by one, in order. Do NOT parallelize or batch image generation. Each slide depends on the previous slide's output as a reference image. Generating slides in parallel will break visual consistency and is not allowed.
Read the image-generation skill: /mnt/skills/public/image-generation/SKILL.md
For the FIRST slide (slide 1) , create a prompt that establishes the visual style:
{
"prompt": "Professional presentation slide. [style_guidelines from plan]. Title: 'Your Title'. [visual_description]. This slide establishes the visual language for the entire presentation.",
"style": "[Based on chosen style - e.g., Apple Keynote aesthetic, dramatic lighting, cinematic]",
"composition": "Clean layout with clear text hierarchy, [style-specific composition]",
"color_palette": "[From style_guidelines]",
"typography": "[From style_guidelines]"
}
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/slide-01-prompt.json \
--output-file /mnt/user-data/outputs/slide-01.jpg \
--aspect-ratio 16:9
3. For subsequent slides (slide 2+) , use the PREVIOUS slide as a reference image:
{
"prompt": "Professional presentation slide continuing the visual style from the reference image. Maintain the same color palette, typography style, and overall aesthetic. Title: 'Slide Title'. [visual_description]. Keep visual consistency with the reference.",
"style": "Match the style of the reference image exactly",
"composition": "Similar layout principles as reference, adapted for this content",
"color_palette": "Same as reference image",
"consistency_note": "This slide must look like it belongs in the same presentation as the reference image"
}
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/slide-02-prompt.json \
--reference-images /mnt/user-data/outputs/slide-01.jpg \
--output-file /mnt/user-data/outputs/slide-02.jpg \
--aspect-ratio 16:9
4. Continue for all remaining slides , always referencing the previous slide:
# Slide 3 references slide 2
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/slide-03-prompt.json \
--reference-images /mnt/user-data/outputs/slide-02.jpg \
--output-file /mnt/user-data/outputs/slide-03.jpg \
--aspect-ratio 16:9
# Slide 4 references slide 3
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/slide-04-prompt.json \
--reference-images /mnt/user-data/outputs/slide-03.jpg \
--output-file /mnt/user-data/outputs/slide-04.jpg \
--aspect-ratio 16:9
After all slide images are generated, call the composition script:
python /mnt/skills/public/ppt-generation/scripts/generate.py \
--plan-file /mnt/user-data/workspace/presentation-plan.json \
--slide-images /mnt/user-data/outputs/slide-01.jpg /mnt/user-data/outputs/slide-02.jpg /mnt/user-data/outputs/slide-03.jpg \
--output-file /mnt/user-data/outputs/presentation.pptx
Parameters:
--plan-file: Absolute path to the presentation plan JSON file (required)--slide-images: Absolute paths to slide images in order (required, space-separated)--output-file: Absolute path to output PPTX file (required)[!NOTE] Do NOT read the python file, just call it with the parameters.
User request: "Create a presentation about AI product launch"
Create /mnt/user-data/workspace/ai-product-plan.json:
{
"title": "Introducing Nova AI",
"style": "glassmorphism",
"style_guidelines": {
"color_palette": "Vibrant purple-to-cyan gradient background (#667eea→#00d4ff), frosted glass panels with 15-20% white opacity, electric accents",
"typography": "SF Pro Display style, bold 700 weight white titles with subtle text-shadow, clean 400 weight body text, excellent contrast on glass",
"imagery": "Abstract 3D glass spheres, floating translucent geometric shapes, soft luminous orbs, depth through layered transparency",
"layout": "Centered frosted glass cards with 32px rounded corners, 48-64px padding, floating above gradient, layered depth with soft shadows",
"effects": "Backdrop blur 20-40px on glass panels, subtle white border glow, soft colored shadows matching gradient, light refraction effects",
"visual_language": "Apple Vision Pro / visionOS aesthetic, premium depth through transparency, futuristic yet approachable, 2024 design trends"
},
"aspect_ratio": "16:9",
"slides": [
{
"slide_number": 1,
"type": "title",
"title": "Introducing Nova AI",
"subtitle": "Intelligence, Reimagined",
"visual_description": "Stunning gradient background flowing from deep purple (#667eea) through magenta to cyan (#00d4ff). Center: large frosted glass panel with strong backdrop blur, containing bold white title 'Introducing Nova AI' and lighter subtitle. Floating 3D glass spheres and abstract shapes around the card creating depth. Soft glow emanating from behind the glass panel. Premium visionOS aesthetic. The glass card has subtle white border (1px rgba 255,255,255,0.3) and soft purple-tinted shadow."
},
{
"slide_number": 2,
"type": "content",
"title": "Why Nova?",
"key_points": ["10x faster processing", "Human-like understanding", "Enterprise-grade security"],
"visual_description": "Same purple-cyan gradient background. Left side: floating frosted glass card with title 'Why Nova?' in bold white, three key points below with subtle glass pill badges. Right side: abstract 3D visualization of neural network as interconnected glass nodes with soft glow. Floating translucent geometric shapes (icosahedrons, tori) adding depth. Consistent glassmorphism aesthetic with previous slide."
},
{
"slide_number": 3,
"type": "content",
"title": "How It Works",
"key_points": ["Natural language input", "Multi-modal processing", "Instant insights"],
"visual_description": "Gradient background consistent with previous slides. Central composition: three stacked frosted glass cards at slight angles showing the workflow steps, connected by soft glowing lines. Each card has an abstract icon. Floating glass orbs and light particles around the composition. Title 'How It Works' in bold white at top. Depth created through card layering and transparency."
},
{
"slide_number": 4,
"type": "content",
"title": "Built for Scale",
"key_points": ["1M+ concurrent users", "99.99% uptime", "Global infrastructure"],
"visual_description": "Same gradient background. Asymmetric layout: right side features large frosted glass panel with metrics displayed in bold typography. Left side: abstract 3D globe made of glass panels and connection lines, representing global scale. Floating data visualization elements as small glass cards with numbers. Soft ambient glow throughout. Premium tech aesthetic."
},
{
"slide_number": 5,
"type": "conclusion",
"title": "The Future Starts Now",
"subtitle": "Join the waitlist",
"visual_description": "Dramatic finale slide. Gradient background with slightly increased vibrancy. Central frosted glass card with bold title 'The Future Starts Now' and call-to-action subtitle. Behind the card: burst of soft light rays and floating glass particles creating celebration effect. Multiple layered glass shapes creating depth. The most visually impactful slide while maintaining style consistency."
}
]
}
Read /mnt/skills/public/image-generation/SKILL.md to understand how to generate images.
Slide 1 - Title (establishes the visual language):
Create /mnt/user-data/workspace/nova-slide-01.json:
{
"prompt": "Ultra-premium presentation title slide with glassmorphism design. Background: smooth flowing gradient from deep purple (#667eea) through magenta (#f093fb) to cyan (#00d4ff), soft and vibrant. Center: large frosted glass panel with strong backdrop blur effect, rounded corners 32px, containing bold white sans-serif title 'Introducing Nova AI' (72pt, SF Pro Display style, font-weight 700) with subtle text shadow, subtitle 'Intelligence, Reimagined' below in lighter weight. The glass panel has subtle white border (1px rgba 255,255,255,0.25) and soft purple-tinted drop shadow. Floating around the card: 3D glass spheres with refraction, translucent geometric shapes (icosahedrons, abstract blobs), creating depth and dimension. Soft luminous glow emanating from behind the glass panel. Small floating particles of light. Apple Vision Pro / visionOS UI aesthetic. Professional presentation slide, 16:9 aspect ratio. Hyper-modern, premium tech product launch feel.",
"style": "Glassmorphism, visionOS aesthetic, Apple Vision Pro UI style, premium tech, 2024 design trends",
"composition": "Centered glass card as focal point, floating 3D elements creating depth at edges, 40% negative space, clear visual hierarchy",
"lighting": "Soft ambient glow from gradient, light refraction through glass elements, subtle rim lighting on 3D shapes",
"color_palette": "Purple gradient #667eea, magenta #f093fb, cyan #00d4ff, frosted white rgba(255,255,255,0.15), pure white text #ffffff",
"effects": "Backdrop blur on glass panels, soft drop shadows with color tint, light refraction, subtle noise texture on glass, floating particles"
}
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/nova-slide-01.json \
--output-file /mnt/user-data/outputs/nova-slide-01.jpg \
--aspect-ratio 16:9
Slide 2 - Content (MUST reference slide 1 for consistency):
Create /mnt/user-data/workspace/nova-slide-02.json:
{
"prompt": "Presentation slide continuing EXACT visual style from reference image. SAME purple-to-cyan gradient background, SAME glassmorphism aesthetic, SAME typography style. Left side: frosted glass card with backdrop blur containing title 'Why Nova?' in bold white (matching reference font style), three feature points as subtle glass pill badges below. Right side: abstract 3D neural network visualization made of interconnected glass nodes with soft cyan glow, floating in space. Floating translucent geometric shapes (matching style from reference) adding depth. The frosted glass has identical treatment: white border, purple-tinted shadow, same blur intensity. CRITICAL: This slide must look like it belongs in the exact same presentation as the reference image - same colors, same glass treatment, same overall aesthetic.",
"style": "MATCH REFERENCE EXACTLY - Glassmorphism, visionOS aesthetic, same visual language",
"composition": "Asymmetric split: glass card left (40%), 3D visualization right (40%), breathing room between elements",
"color_palette": "EXACTLY match reference: purple #667eea, cyan #00d4ff gradient, same frosted white treatment, same text white",
"consistency_note": "CRITICAL: Must be visually identical in style to reference image. Same gradient colors, same glass blur intensity, same shadow treatment, same typography weight and style. Viewer should immediately recognize this as the same presentation."
}
python /mnt/skills/public/image-generation/scripts/generate.py \
--prompt-file /mnt/user-data/workspace/nova-slide-02.json \
--reference-images /mnt/user-data/outputs/nova-slide-01.jpg \
--output-file /mnt/user-data/outputs/nova-slide-02.jpg \
--aspect-ratio 16:9
Slides 3-5: Continue the same pattern, each referencing the previous slide
Key consistency rules for subsequent slides:
consistency_note emphasizing style matchingpython /mnt/skills/public/ppt-generation/scripts/generate.py \
--plan-file /mnt/user-data/workspace/nova-plan.json \
--slide-images /mnt/user-data/outputs/nova-slide-01.jpg /mnt/user-data/outputs/nova-slide-02.jpg /mnt/user-data/outputs/nova-slide-03.jpg /mnt/user-data/outputs/nova-slide-04.jpg /mnt/user-data/outputs/nova-slide-05.jpg \
--output-file /mnt/user-data/outputs/nova-presentation.pptx
{
"style": "glassmorphism",
"style_guidelines": {
"color_palette": "Vibrant gradient backgrounds (purple #667eea to pink #f093fb, or cyan #4facfe to blue #00f2fe), frosted white panels with 20% opacity, accent colors that pop against the gradient",
"typography": "SF Pro Display or Inter font style, bold 600-700 weight titles, clean 400 weight body, white text with subtle drop shadow for readability on glass",
"imagery": "Abstract 3D shapes floating in space, soft blurred orbs, geometric primitives with glass material, depth through overlapping translucent layers",
"layout": "Floating card panels with backdrop-blur effect, generous padding (48-64px), rounded corners (24-32px radius), layered depth with subtle shadows",
"effects": "Frosted glass blur (backdrop-filter: blur 20px), subtle white border (1px rgba 255,255,255,0.2), soft glow behind panels, floating elements with drop shadows",
"visual_language": "Premium tech aesthetic like Apple Vision Pro UI, depth through transparency, light refracting through glass surfaces"
}
}
{
"style": "dark-premium",
"style_guidelines": {
"color_palette": "Deep black base (#0a0a0a to #121212), luminous accent color (electric blue #00d4ff, neon purple #bf5af2, or gold #ffd700), subtle gray gradients for depth (#1a1a1a to #0a0a0a)",
"typography": "Elegant sans-serif (Neue Haas Grotesk or Suisse Int'l style), dramatic size contrast (72pt+ headlines, 18pt body), letter-spacing -0.02em for headlines, pure white (#ffffff) text",
"imagery": "Dramatic studio lighting, rim lights and edge glow, cinematic product shots, abstract light trails, premium material textures (brushed metal, matte surfaces)",
"layout": "Generous negative space (60%+), asymmetric balance, content anchored to grid but with breathing room, single focal point per slide",
"effects": "Subtle ambient glow behind key elements, light bloom effects, grain texture overlay (2-3% opacity), vignette on edges",
"visual_language": "Luxury tech brand aesthetic (Bang & Olufsen, Porsche Design), sophistication through restraint, every element intentional"
}
}
{
"style": "gradient-modern",
"style_guidelines": {
"color_palette": "Bold mesh gradients (Stripe/Linear style: purple-pink-orange #7c3aed→#ec4899→#f97316, or cool tones: cyan-blue-purple #06b6d4→#3b82f6→#8b5cf6), white or dark text depending on background intensity",
"typography": "Modern geometric sans-serif (Satoshi, General Sans, or Clash Display style), variable font weights, oversized bold headlines (80pt+), comfortable body text (20pt)",
"imagery": "Abstract fluid shapes, morphing gradients, 3D rendered abstract objects, soft organic forms, floating geometric primitives",
"layout": "Dynamic asymmetric compositions, overlapping elements with blend modes, text integrated with gradient flows, full-bleed backgrounds",
"effects": "Smooth gradient transitions, subtle noise texture (3-5% for depth), soft shadows with color tint matching gradient, motion blur suggesting movement",
"visual_language": "Contemporary SaaS aesthetic (Stripe, Linear, Vercel), energetic yet professional, forward-thinking tech vibes"
}
}
{
"style": "neo-brutalist",
"style_guidelines": {
"color_palette": "High contrast primaries: stark black, pure white, with bold accent (hot pink #ff0080, electric yellow #ffff00, or raw red #ff0000), optional: Memphis-inspired pastels as secondary",
"typography": "Ultra-bold condensed type (Impact, Druk, or Bebas Neue style), UPPERCASE headlines, extreme size contrast, intentionally tight or overlapping letter-spacing",
"imagery": "Raw unfiltered photography, intentional visual noise, halftone patterns, cut-out collage aesthetic, hand-drawn elements, stickers and stamps",
"layout": "Broken grid, overlapping elements, thick black borders (4-8px), visible structure, anti-whitespace (dense but organized chaos)",
"effects": "Hard shadows (no blur, offset 8-12px), pixelation accents, scan lines, CRT screen effects, intentional 'mistakes'",
"visual_language": "Anti-corporate rebellion, DIY zine aesthetic meets digital, raw authenticity, memorable through boldness"
}
}
{
"style": "3d-isometric",
"style_guidelines": {
"color_palette": "Soft contemporary palette: muted purples (#8b5cf6), teals (#14b8a6), warm corals (#fb7185), with cream or light gray backgrounds (#fafafa), consistent saturation across elements",
"typography": "Friendly geometric sans-serif (Circular, Gilroy, or Quicksand style), medium weight headlines, excellent readability, comfortable 24pt body text",
"imagery": "Clean isometric 3D illustrations, consistent 30° isometric angle, soft clay-render aesthetic, floating platforms and devices, cute simplified objects",
"layout": "Central isometric scene as hero, text balanced around 3D elements, clear visual hierarchy, comfortable margins (64px+)",
"effects": "Soft drop shadows (20px blur, 30% opacity), ambient occlusion on 3D objects, subtle gradients on surfaces, consistent light source (top-left)",
"visual_language": "Friendly tech illustration (Slack, Notion, Asana style), approachable complexity, clarity through simplification"
}
}
{
"style": "editorial",
"style_guidelines": {
"color_palette": "Sophisticated neutrals: off-white (#f5f5f0), charcoal (#2d2d2d), with single accent color (burgundy #7c2d12, forest #14532d, or navy #1e3a5f), occasional full-color photography",
"typography": "Refined serif for headlines (Playfair Display, Freight, or Editorial New style), clean sans-serif for body (Söhne, Graphik), dramatic size hierarchy (96pt headlines, 16pt body), generous line-height 1.6",
"imagery": "Magazine-quality photography, dramatic crops, full-bleed images, portraits with intentional negative space, editorial lighting (Vogue, Bloomberg Businessweek style)",
"layout": "Sophisticated grid system (12-column), intentional asymmetry, pull quotes as design elements, text wrapping around images, elegant margins",
"effects": "Minimal effects - let photography and typography shine, subtle image treatments (slight desaturation, film grain), elegant borders and rules",
"visual_language": "High-end magazine aesthetic, intellectual sophistication, content elevated through design restraint"
}
}
{
"style": "minimal-swiss",
"style_guidelines": {
"color_palette": "Pure white (#ffffff) or off-white (#fafaf9) backgrounds, true black (#000000) text, single bold accent (Swiss red #ff0000, Klein blue #002fa7, or signal yellow #ffcc00)",
"typography": "Helvetica Neue or Aktiv Grotesk, strict type scale (12/16/24/48/96), medium weight for body, bold for emphasis only, flush-left ragged-right alignment",
"imagery": "Objective photography, geometric shapes, clean iconography, mathematical precision, intentional empty space as compositional element",
"layout": "Strict grid adherence (baseline grid visible in spirit), modular compositions, generous whitespace (40%+ of slide), content aligned to invisible grid lines",
"effects": "None - purity of form, no shadows, no gradients, no decorative elements, occasional single hairline rules",
"visual_language": "International Typographic Style, form follows function, timeless modernism, Dieter Rams-inspired restraint"
}
}
{
"style": "keynote",
"style_guidelines": {
"color_palette": "Deep blacks (#000000 to #1d1d1f), pure white text, signature blue (#0071e3) or gradient accents (purple-pink for creative, blue-teal for tech)",
"typography": "San Francisco Pro Display, extreme weight contrast (bold 80pt+ titles, light 24pt body), negative letter-spacing on headlines (-0.03em), optical alignment",
"imagery": "Cinematic photography, shallow depth of field, dramatic lighting (rim lights, spot lighting), product hero shots with reflections, full-bleed imagery",
"layout": "Maximum negative space, single powerful image or statement per slide, content centered or dramatically offset, no clutter",
"effects": "Subtle gradient overlays, light bloom and glow on key elements, reflection on surfaces, smooth gradient backgrounds",
"visual_language": "Apple WWDC keynote aesthetic, confidence through simplicity, every pixel considered, theatrical presentation"
}
}
After generation:
/mnt/user-data/outputs/present_files toolPrompt Engineering for Professional Results:
Visual Consistency (Most Important):
consistency_note field in every JSON prompt after slide 1Design Principles for Modern Aesthetics:
Common Mistakes to Avoid:
Recommended Styles for Different Contexts:
glassmorphism or gradient-moderndark-premium or editorialgradient-modern or minimal-swissdark-premium or keynoteneo-brutalist or gradient-modernWeekly Installs
310
Repository
GitHub Stars
29.5K
First Seen
Feb 17, 2026
Security Audits
Gen Agent Trust HubPassSocketPassSnykPass
Installed on
opencode304
codex301
gemini-cli301
github-copilot301
kimi-cli300
amp300
AI Elements:基于shadcn/ui的AI原生应用组件库,快速构建对话界面
56,200 周安装
minimal-swiss3d-isometric