Nano Banana 2 AI图像生成大师 - 结构化JSON参数生成超真实图像

Nano Banana 2 Image Generation Master by aiagentwithdhruv/skills

4 GitHub Stars

GitHub

安装命令

npx skills add https://github.com/aiagentwithdhruv/skills --skill 'Nano Banana 2 Image Generation Master'

AI/机器学习内容创作自动化

🇨🇳中文介绍

Nano Banana 2 图像生成大师

目标

本技能的用途是提供一个标准化、高度可控的方法，使用 AI 模型 Nano Banana 2（或任何连接到 generate_image 工具的底层模型）来生成图像。通过严格执行结构化的 JSON 参数模式，此技能可以中和原生模型的偏见（如过度平滑、数据集平均化或"塑料"AI 风格），并确保输出原始、未经修饰、超真实的图像。

前提条件

fal.ai API 密钥（在 .env 文件中设置为 FAL_KEY）— 在 https://fal.ai 注册（免费套餐，Nano Banana 2 模型）
或 Euri API 密钥（在 .env 文件中设置为 EURI_API_KEY）— Euron 学生在 https://euron.one/euri 可免费获取
对用户期望的主体、光照和相机特性有清晰的理解。

核心模式结构

为 generate_image 工具构建提示时，你必须使用以下 JSON 模式作为基础。用极其微观的细节填充字符串值。

广告位招租

在这里展示您的产品或服务

触达数万 AI 开发者，精准高效

联系我们

相关 Skills

FlyClaw：零登录航班聚合查询工具，Python实现多源航班信息与价格搜索

4,000,000 周安装

find-skills 技能搜索工具 - Vercel Labs 开源智能体技能包管理器

794,900 周安装

Azure RBAC 权限管理工具：查找最小角色、创建自定义角色与自动化分配

110,700 周安装

React 组合模式指南：Vercel 组件架构最佳实践，提升代码可维护性

107,800 周安装

{
  "task": "string - 高级目标（例如，'sports_selfie_collage'、'single_macro_portrait'）",
  
  "output": {
    "type": "string - 例如，'single_image'、'4-panel_collage'",
    "layout": "string - 例如，'1x1'、'2x2_grid'、'side-by-side'",
    "aspect_ratio": "string - 例如，'3:4'、'16:9'、'4:5'",
    "resolution": "string - 例如，'ultra_high'、'medium_low'",
    "camera_style": "string - 例如，'smartphone_front_camera'、'professional_dslr'"
  },

  "image_quality_simulation": {
    "sharpness": "string - 例如，'tack_sharp'、'slightly_soft_edges'",
    "noise": "string - 例如，'unfiltered_sensor_grain'、'visible_film_grain'、'clean_digital'",
    "compression_artifacts": "boolean - 如果尝试模拟上传的用户生成内容，则为 true",
    "dynamic_range": "string - 例如，'limited'、'hdr_capable'",
    "white_balance": "string - 例如，'slightly_warm'、'cool_fluorescent'",
    "lens_imperfections": [
      "array of strings - 例如，'subtle chromatic aberration'、'minor lens distortion'、'vignetting'"
    ]
  },

  "subject": {
    "type": "string - 例如，'human_portrait'、'nature_macro'、'infographic_flatlay'",
    "human_details": {
      "//": "仅对人类主体使用此块",
      "identity": "string",
      "appearance": "string - 极其具体（例如，visible pores, mild redness）",
      "outfit": "string"
    },
    "object_or_nature_details": {
      "//": "对非人类主体使用此块",
      "material_or_texture": "string - 例如，'brushed aluminum'、'dew-covered velvety petals'",
      "wear_and_tear": "string - 例如，'subtle scratches on the anodized finish'、'browning edges on leaves'",
      "typography": "string - 例如，'clean sans-serif overlaid text, perfectly legible'"
    }
  },

  "multi_panel_layout": {
    "grid_panels": [
      {
        "panel": "string - 例如，'top_left'、'full_frame'（如果不是网格）",
        "pose": "string - 例如，'slight upward selfie angle, relaxed smile'",
        "action": "string - 例如，'holding phone with one hand, casual posture'"
      }
    ]
  },

  "environment": {
    "location": "string - 例如，'gym or outdoor sports area'",
    "background": "string - 主体背后是什么（例如，'blurred gym equipment'）",
    "lighting": {
      "type": "string - 例如，'natural or overhead gym lighting'、'harsh direct sunlight'",
      "quality": "string - 例如，'uneven, realistic, non-studio'、'high-contrast dramatic'"
    }
  },

  "embedded_text_and_overlays": {
    "text": "string (optional)",
    "location": "string (optional)"
  },

  "structural_preservation": {
    "preservation_rules": [
      "array of strings - 例如，'Exact physical proportions must be preserved'"
    ]
  },

  "controlnet": {
    "pose_control": {
      "model_type": "string - 例如，'DWPose'",
      "purpose": "string",
      "constraints": ["array of strings"],
      "recommended_weight": "number"
    },
    "depth_control": {
      "model_type": "string - 例如，'ZoeDepth'",
      "purpose": "string",
      "constraints": ["array of strings"],
      "recommended_weight": "number"
    }
  },

  "explicit_restrictions": {
    "no_professional_retouching": "boolean - 通常为 true 以保证真实感",
    "no_studio_lighting": "boolean - 通常为 true 以模拟抓拍效果",
    "no_ai_beauty_filters": "boolean - 必须为 true 以避免塑料感",
    "no_high_end_camera_look": "boolean - 如果模拟智能手机拍摄，则为 true"
  },

  "negative_prompt": {
    "forbidden_elements": [
      "array of strings - 实现极致真实感所需的大量'AI风格'阻断词列表。示例堆栈：'anatomy normalization'、'body proportion averaging'、'dataset-average anatomy'、'wide-angle distortion not in reference'、'lens compression not in reference'、'cropping that removes volume'、'depth flattening'、'mirror selfies'、'reflections'、'beautification filters'、'skin smoothing'、'plastic skin'、'airbrushed texture'、'stylized realism'、'editorial fashion proportions'、'more realistic reinterpretation'"
    ]
  }
}

相机参数： 始终定义精确的焦距、光圈和 ISO（例如，85mm lens, f/2.0, ISO 200）。这迫使模型模仿光学物理特性，而不是数字渲染。
明确瑕疵： 像"真实感"这样的词是不够的。要指定瑕疵：mild redness、subtle freckles、light acne marks、unguided grooming。
直接命令： 在正向提示段落中内部使用祈使否定命令：Do not beautify or alter facial features. No makeup styling.
光照行为： 不要只命名光源，要描述它的效果：direct flash photography, creating sharp highlights on skin and a slightly shadowed background.
非人类材质（产品/自然）： 当生成非人类对象时，用极端的材质物理特性替换皮肤/服装逻辑。定义表面划痕（例如，"micro-scratches on anodized aluminum"）、光线散射（例如，"subsurface scattering through dew-covered petals"）或图形布局（例如，"flat-lay composition, clean sans-serif typography"）。
强制否定堆栈： 你必须包含广泛的否定提示块（例如，禁止"skin smoothing"和"anatomy normalization"）。
避免过度降质（噪声陷阱）： 虽然模拟相机缺陷（如 compression artifacts）有助于真实感，但在复杂、高对比度的环境（如霓虹灯夜晚街道）中过度使用 ISO 3200 或 heavy film grain 实际上会触发模型的"数字艺术/插图"偏见。将 ISO 设置保持在 800 以下，并依靠物理主体的瑕疵（如桃色绒毛或不对称的毛孔）来体现真实感，而不是依赖大量的相机噪点。

🇺🇸English

Nano Banana 2 Image Generation Master

Goal

The purpose of this skill is to provide a standardized, highly controlled method for generating images using AI model Nano Banana 2 (or any underlying model connected to the generate_image tool). By strictly enforcing a structured JSON parameter schema, this skill neutralizes native model biases (like over-smoothing, dataset-averaging, or "plastic" AI styling) and ensures raw, unretouched, hyper-realistic outputs.

Prerequisites

fal.ai API key (FAL_KEY in .env) — sign up at https://fal.ai (free tier, Nano Banana 2 model)
OR Euri API key (EURI_API_KEY in .env) — free for Euron students at https://euron.one/euri
A clear understanding of the user's desired Subject, Lighting, and Camera characteristics.

Core Schema Structure

When constructing a prompt for the generate_image tool, you MUST use the following JSON schema as the foundation. Fill in the string values with extreme, microscopic detail.

{
  "task": "string - High-level goal (e.g., 'sports_selfie_collage', 'single_macro_portrait')",
  
  "output": {
    "type": "string - e.g., 'single_image', '4-panel_collage'",
    "layout": "string - e.g., '1x1', '2x2_grid', 'side-by-side'",
    "aspect_ratio": "string - e.g., '3:4', '16:9', '4:5'",
    "resolution": "string - e.g., 'ultra_high', 'medium_low'",
    "camera_style": "string - e.g., 'smartphone_front_camera', 'professional_dslr'"
  },

  "image_quality_simulation": {
    "sharpness": "string - e.g., 'tack_sharp', 'slightly_soft_edges'",
    "noise": "string - e.g., 'unfiltered_sensor_grain', 'visible_film_grain', 'clean_digital'",
    "compression_artifacts": "boolean - true if attempting to simulate uploaded UGC",
    "dynamic_range": "string - e.g., 'limited', 'hdr_capable'",
    "white_balance": "string - e.g., 'slightly_warm', 'cool_fluorescent'",
    "lens_imperfections": [
      "array of strings - e.g., 'subtle chromatic aberration', 'minor lens distortion', 'vignetting'"
    ]
  },

  "subject": {
    "type": "string - e.g., 'human_portrait', 'nature_macro', 'infographic_flatlay'",
    "human_details": {
      "//": "Use this block ONLY for human subjects",
      "identity": "string",
      "appearance": "string - Extremely specific (e.g., visible pores, mild redness)",
      "outfit": "string"
    },
    "object_or_nature_details": {
      "//": "Use this block for non-human subjects",
      "material_or_texture": "string - e.g., 'brushed aluminum', 'dew-covered velvety petals'",
      "wear_and_tear": "string - e.g., 'subtle scratches on the anodized finish', 'browning edges on leaves'",
      "typography": "string - e.g., 'clean sans-serif overlaid text, perfectly legible'"
    }
  },

  "multi_panel_layout": {
    "grid_panels": [
      {
        "panel": "string - e.g., 'top_left', 'full_frame' (if not a grid)",
        "pose": "string - e.g., 'slight upward selfie angle, relaxed smile'",
        "action": "string - e.g., 'holding phone with one hand, casual posture'"
      }
    ]
  },

  "environment": {
    "location": "string - e.g., 'gym or outdoor sports area'",
    "background": "string - What is behind the subject (e.g., 'blurred gym equipment')",
    "lighting": {
      "type": "string - e.g., 'natural or overhead gym lighting', 'harsh direct sunlight'",
      "quality": "string - e.g., 'uneven, realistic, non-studio', 'high-contrast dramatic'"
    }
  },

  "embedded_text_and_overlays": {
    "text": "string (optional)",
    "location": "string (optional)"
  },

  "structural_preservation": {
    "preservation_rules": [
      "array of strings - e.g., 'Exact physical proportions must be preserved'"
    ]
  },

  "controlnet": {
    "pose_control": {
      "model_type": "string - e.g., 'DWPose'",
      "purpose": "string",
      "constraints": ["array of strings"],
      "recommended_weight": "number"
    },
    "depth_control": {
      "model_type": "string - e.g., 'ZoeDepth'",
      "purpose": "string",
      "constraints": ["array of strings"],
      "recommended_weight": "number"
    }
  },

  "explicit_restrictions": {
    "no_professional_retouching": "boolean - typically true for realism",
    "no_studio_lighting": "boolean - typically true for candid shots",
    "no_ai_beauty_filters": "boolean - mandatory true to avoid plastic look",
    "no_high_end_camera_look": "boolean - true if simulating smartphones"
  },

  "negative_prompt": {
    "forbidden_elements": [
      "array of strings - Massive list of 'AI style' blockers required for extreme realism. Example stack: 'anatomy normalization', 'body proportion averaging', 'dataset-average anatomy', 'wide-angle distortion not in reference', 'lens compression not in reference', 'cropping that removes volume', 'depth flattening', 'mirror selfies', 'reflections', 'beautification filters', 'skin smoothing', 'plastic skin', 'airbrushed texture', 'stylized realism', 'editorial fashion proportions', 'more realistic reinterpretation'"
    ]
  }
}

Paradigm 2: The Dense Narrative Format (Optimized for APIs like fal.ai)

When executing API calls to standard generation endpoints (which often only accept string prompts), it is incredibly powerful to condense the logic above into a dense, flat JSON string containing a massive descriptive text block.

{
  "prompt": "string - A dense, ultra-descriptive narrative. Use specific camera math (85mm lens, f/1.8, ISO 200), explicit flaws (visible pores, mild redness, subtle freckles, light acne marks), lighting behavior (direct on-camera flash creating sharp highlights), and direct negative commands (Do not beautify or alter facial features).",
  "negative_prompt": "string - A comma-separated list of explicit realism blockers (no plastic skin, no CGI).",
  "image_input": [
    "array of strings (URLs) - Optional. Input images to transform or use as reference (up to 14). Formatting: URL to jpeg, png, or webp. Max size: 30MB."
  ],
  "api_parameters": {
    "google_search": "boolean - Optional. Use Google Web Search grounding",
    "resolution": "string - Optional. '1K', '2K', or '4K' (default 1K)",
    "output_format": "string - Optional. 'jpg' or 'png' (default jpg)",
    "aspect_ratio": "string - Optional. Overrides CLI aspect_ratio (e.g., '16:9', '4:5', 'auto')"
  },
  "settings": {
    "resolution": "string",
    "style": "string - e.g., 'documentary realism'",
    "lighting": "string - e.g., 'direct on-camera flash'",
    "camera_angle": "string",
    "depth_of_field": "string - e.g., 'shallow depth of field'",
    "quality": "string - e.g., 'high detail, unretouched skin'"
  }
}

Best Practices & Natural Language Hacks

Camera Mathematics: Always define exact focal length, aperture, and ISO (e.g., 85mm lens, f/2.0, ISO 200). This forces the model to mimic optical physics rather than digital rendering.
Explicit Imperfections: Words like "realistic" are not enough. Dictate flaws: mild redness, subtle freckles, light acne marks, unguided grooming.
Direct Commands: Use imperative negative commands inside the positive prompt paragraph: Do not beautify or alter facial features. No makeup styling.
Lighting Behavior: Don't just name the light, name what it does: direct flash photography, creating sharp highlights on skin and a slightly shadowed background.
Non-Human Materials (Products/Nature): When generating non-humans, replace skin/outfit logic with extreme material physics. Define surface scoring (e.g., "micro-scratches on anodized aluminum"), light scattering (e.g., "subsurface scattering through dew-covered petals"), or graphic layouts (e.g., "flat-lay composition, clean sans-serif typography").

Master Reference Guide

If you require the absolute full schema breakdown, parameter options, or the complex JSON structing for multi-panel grids, refer to: master_prompt_reference.md (in this skill's folder)

Execution via fal.ai (Primary — Replaces kie.ai)

Use fal.ai's Nano Banana 2 model for hyper-realistic image generation.

Prerequisites:

Your .env file must contain FAL_KEY="your_key" (get at https://fal.ai/dashboard/keys)
A JSON prompt file matching the Dense Narrative Format saved in /prompts/

Execution:

# Using the Videos toolkit script (recommended) — defaults to nano-banana-2
python Social-Media-Agent-1.0/Videos/scripts/generate_fal.py "<dense_prompt>" output.jpg --size portrait_4_3

# Or using the legacy kie.ai script (if you still have KIE_API_KEY)
python scripts/generate_kie.py prompts/your_prompt.json images/output_image.jpg "4:5"

Execution via Euri API (Alternative — Free for Students)

Use Euri's Gemini 3 Pro Image Preview model.

python Social-Media-Agent-1.0/Videos/scripts/generate_euri.py "<dense_prompt>" output.jpg

Env: EURI_API_KEY in .env | Free: 200K tokens/day

How to use this skill

When a user asks you to generate a highly detailed, realistic, or complex image, you must construct the prompt string formatted EXACTLY like the JSON schema above. Pass that entire string as the prompt argument to the generation script (fal.ai preferred, Euri as fallback).

Weekly Installs

Repository

aiagentwithdhruv/skills

GitHub Stars

First Seen

Jan 1, 1970

Security Audits

Gen Agent Trust HubFail SocketPass SnykWarn