Google Gemini 2.5 Nano banana生成东方女性

130 阅读6分钟

背景

Google Gemini 2.5 Nano Banana(官方名称为 Gemini 2.5 Flash Image)是谷歌于 2025 年 8 月推出的革命性 AI 图像生成与编辑模型,凭借其多模态架构、角色一致性和实时协作能力,重新定义了 AI 视觉内容创作的标准。采用统一的 Transformer 架构,原生支持文本与图像的无缝融合,无需中间转换步骤。例如,用户可直接通过自然语言指令(如 “将这张照片改为复古港风,背景换成 80 年代香港街头”)实现复杂编辑,模型能精准对齐语义与视觉元素,避免传统模型的信息丢失问题。内置 Gemini 家族的全球知识库,可理解物理原理(如光影反射、透视关系)和文化语境。例如,生成 “巴黎铁塔夜景中的产品图” 时,模型会自动匹配环境光色温与产品表面反光效果,确保物理真实性。

99% 面部特征保留突破性的 “全局外观 Token + 局部细节 Token” 双重约束机制,可在不同场景、光线和服装变化中保持人物、宠物甚至产品的外观一致性。

支持连续多轮修改,用户可逐步细化需求(如先模糊背景,再调整姿势,最后添加道具),模型全程保持上下文记忆,避免传统工具的 “断层式” 修改。
提供 Gemini API、Google AI Studio 等接入方式,支持 Python、JavaScript 等主流语言,开发者可快速集成到电商平台、教育工具或设计软件中。例如,电商网站可通过 API 自动生成多场景产品图。

Gemini_Generated_Image_2m25wk2m25wk2m25

提示词

{
"style": "High-key studio portrait, direct flash aesthetic, East Asian social media style (e.g., Ulzzang, Douyin), stylized beauty retouching.",
"output": {
"color_profile": "sRGB",
"render_intent": "photo"
},
"subject": {
"category": "human",
"gender_presentation": "female",
"ethnicity": "East Asian (e.g., Korean, Chinese)",
"age_bracket": "young_adult",
"body": {
"build": "slim",
"proportions": "natural human anatomy",
"posture": "relaxed on sofa, seated casually",
"pose": "seated, legs crossed and tucked close to body",
"gesture": "Right hand raised, fingers loosely curled, back of fingers/knuckles gently supporting the chin and lower cheek.",
"head_tilt_deg": 5
},
"face": {
"expression": "Playful, alluring",
"gaze": "right eye direct to camera",
"eye_action": "winking with the left eye",
"skin_tone": "Very pale porcelain (lightened aesthetic)",
"makeup": "Stylized K-Beauty/Douyin look: flawless matte base, strong pink blush high on cheeks, pink gradient lips, defined brows, light eyeliner, emphasized Aegyo-sal",
"features": "small beauty mark/mole under the left eye"
},
"hair": {
"length": "long",
"style": "messy high updo/bun with loose strands and curtain bangs",
"color": "dark brown"
},
"wardrobe": {
"top": "white fitted cropped camisole",
"outerwear": "light gray zip hoodie, worn open and slightly slipping off both shoulders",
"bottom": "white lounge shorts with drawstring",
"footwear": "barefoot"
}
},
"environment": {
"location": "studio or minimalist interior",
"set": "black leather sofa against a plain white or light gray wall",
"props": "Silver laptop (Apple MacBook, logo visible) placed on the cushion to the subject's right (camera left)"
},
"lighting": {
"key": {
"source": "strobe/flash",
"modifier": "Bare reflector or direct flash (hard source)",
"position": "Near camera axis, slightly camera-right and above eye line",
"effect": "Crisp, dark, well-defined cast shadows on the wall directly behind subject; strong specular highlights on skin and sofa leather."
},
"fill": {
"type": "minimal/none"
},
"ambient": "suppressed",
"white_balance_K": 5800
},
"camera": {
"system": "Digital Camera",
"sensor": "full-frame equivalent",
"lens": {
"type": "prime",
"focal_length_mm": 50
},
"exposure": {
"iso": 100,
"aperture_f": 4.0,
"metering": "Bright exposure, high-key aesthetic"
},
"focus": {
"target": "near eye (right eye)",
"depth_of_field": "moderate"
},
"framing": {
"orientation": "vertical",
"crop": "mid-thigh to head with room above hair",
"angle": "eye-level",
"composition": "subject centrally framed"
}
},
"color_grade": {
"look": "Bright, clean, slightly cool tone",
"contrast": "High contrast",
"saturation": "moderate, emphasized pinks"
},
"postprocess": {
"noise_reduction": "high",
"texture": "Highly smoothed skin, poreless appearance ('porcelain doll' or 'beauty filter' effect)",
"sharpen": "selective on eyes/lashes",
"blemish_control": "Complete removal of all blemishes and texture."
},
"quality_targets": [
"accurate limb lengths and joint angles",
"correct finger count and articulation",
"realistic fabric tension and folds",
"accurate winking expression"
],
"negative_prompt": [
"no altered or exaggerated body proportions",
"no extra or fused fingers",
"no realistic skin texture, pores, or blemishes",
"no text or watermarks (excluding specified logos)",
"no extreme wide-angle distortion",
"no NSFW content",
"no dark/moody lighting",
"no warm tones"
]
}

中文版

{ “style”: “高调的工作室人像,直接闪光美学,东亚社交媒体风格(如,Ulzzang、抖音),风格化的美颜修饰。 “输出”: { “color_profile”: “sRGB”, “render_intent”: “照片” }, “主题”: { “category”: “人类”, “gender_presentation”: “女性”, “ethnicity”: “东亚人(例如,韩国人、中国人)”, “age_bracket”: “young_adult”, “正文”: { “build”: “苗条”, “proportions”: “自然人体解剖学”, “posture”:“放松在沙发上,随意坐着”, “pose”: “坐着,双腿交叉并靠近身体”, “gesture”: “右手抬起,手指松散卷曲,手指背/指关节轻轻支撑下巴和下脸颊。 “head_tilt_deg”:5 }, “脸”: { “expression”:“俏皮,诱人”, “gaze”:“右眼直视镜头”, “eye_action”:“用左眼眨眼”, “skin_tone”:“非常苍白的瓷器(轻巧的美学)”, “makeup”: “风格化的 K-Beauty/抖音妆容:完美无瑕的哑光底妆,高高的脸颊上浓郁的粉红色腮红,粉红色渐变的嘴唇,轮廓分明的眉毛,浅色眼线,强调爱娇萨尔”, “features”: “左眼下方的小美痕/痣” }, “头发”: { “length”: “long”, “style”: “凌乱的高髻/发髻,松散的股线和窗帘刘海”, “color”: “深棕色” }, “衣柜”: { “top”: “白色合身短款吊带背心”, “outerwear”: “浅灰色拉链连帽衫,敞开穿着,双肩略微滑落”, “bottom”: “带抽绳的白色休闲短裤”, “footwear”: “赤脚” } }, “环境”: { “location”: “工作室或极简主义室内”, “set”: “黑色真皮沙发靠在纯白色或浅灰色的墙壁上”, “props”: “银色笔记本电脑(Apple MacBook,徽标可见)放置在拍摄对象右侧(相机左侧)的垫子上” }, “照明”: { “键”: { “source”: “频闪/闪光灯”, “modifier”: “裸反射器或直接闪光灯(硬源)”, “position”: “靠近相机轴,略微向右和视线上方”, “effect”: “清晰、黑暗、轮廓分明的阴影投射到拍摄对象正后方的墙壁上;皮肤和沙发皮革上强烈的镜面高光。 }, “填充”: { “type”: “最小/无” }, “ambient”: “抑制”, “white_balance_K”:5800 }, “相机”: { “system”: “数码相机”, “sensor”: “全画幅等效”, “镜头”: { “type”: “prime”, “focal_length_mm”:50 }, “曝光”: { “iso”:100, “aperture_f”: 4.0, “metering”:“明亮曝光,高调审美” }, “焦点”: { “target”: “近眼(右眼)”, “depth_of_field”: “中等” }, “框架”: { “orientation”: “垂直”, “crop”: “从大腿中部到头部,头发上方有空间”, “angle”: “视线水平”, “composition”: “主题集中框” } }, “color_grade”: { “look”:“明亮、干净、略带冷色调”, “contrast”: “高对比度”, “saturation”: “适度、强调粉红色” }, “后处理”: { “noise_reduction”: “高”, “texture”: “高度光滑的皮肤,无毛孔的外观('瓷娃娃'或'美颜滤镜'效果)”, “sharpen”:“选择性地涂抹眼睛/睫毛”, “blemish_control”: “彻底去除所有瑕疵和纹理。 }, “quality_targets”: [ “准确的肢体长度和关节角度”, “正确的手指计数和发音”, “逼真的织物张力和褶皱”, “准确的眨眼表情” ], “negative_prompt”: [ “没有改变或夸张的身体比例”, “没有多余的或融合的手指”, “没有逼真的皮肤纹理、毛孔或瑕疵”, “无文字或水印(不包括指定徽标)”, “没有极端的广角畸变”, “没有 NSFW 内容”, “没有黑暗/喜怒无常的灯光”, “没有暖色调” ] }

结论

         Gemini 2.5 Flash Image给我们很多创意空间,只要你感想感做,大部分图像都可以生成。结构化JSON代码容易复用与改造。