🔧 阿川の電商水電行

Shopify 顧問、維護與客製化

💡

小任務 / 單次支援方案

單次處理 Shopify 修正／微調

⭐️

維護方案

每月 Shopify 技術支援 + 小修改 + 諮詢

🚀

專案建置

Shopify 功能導入、培訓 + 分階段交付

👉 瞭解詳情 / 免費諮詢

小編精選 - 技術文章翻譯 · 12月09日

Nano Banana Pro 簡介：完整開發者教程

你喜歡Nano-Banana嗎？用它製作過所有朋友的人偶圖像，以及所有敵人的鬼臉？現在，這款尺寸更大的「 Gemini 3 Pro Image 」機型來了，你們肯定會更喜歡稱它為Nano Banana Pro ！

Flash 版（Nano Banana）以其速度和價格優勢著稱，而 Pro 版則引入了「思考」功能、搜尋功能和高保真 4K 輸出。是時候用它輕鬆應付複雜的創意任務了！

本指南將引導您使用Gemini Developer API了解 Nano Banana Pro 的進階功能。

本指南將涵蓋以下內容：

在 Google AI Studio 中使用 Nano Banana Pro
專案設定
初始化客戶端
基本生成（經典）
「思考」過程
搜尋接地
高解析度 4K 世代
多語言能力
進階影像混合
專業版專屬演示

注意：若要查看此貼文的互動版本，請查看python cookbook或 AI Studio 的Javascript Notebook 。

1) 在 Google AI Studio 中使用 Nano Banana Pro

雖然最終用戶可以透過Gemini 應用程式存取 Nano Banana Pro，但對於開發者而言，進行原型設計和測試的最佳環境是Google AI Studio。 AI Studio 是一個實驗平台，開發者可以在編寫任何程式碼之前體驗所有可用的 AI 模型，它也是使用 Gemini API 進行建置的入口點。

您可以在 AI Studio 中使用 Nano Banana Pro。要開始使用，請存取aistudio.google.com ，使用您的 Google 帳戶登錄，然後從模型選擇器中選擇Nano Banana Pro （Gemini 3 Pro 圖像）。

與 Nano-Banana 不同，專業版沒有免費層級，這表示您需要選擇啟用計費功能的 API 金鑰（請參閱下方的「專案設定」部分）。

在 AI Studio 上開始使用 Nano Banana Pro

提示：您也可以直接在 AI Studio ( ai.studio/apps)中編寫 Nano Banana Web 應用程式，或瀏覽程式碼並重新混合現有應用程式之一。

2）專案設置

要按照本指南操作，您需要以下物品：

來自Google AI Studio 的API 金鑰。
為您的專案設定計費方式。
適用於Python或JavaScript/TypeScript的 Google Gen AI SDK。

如果您已經是 Gemini API 的資深用戶，掌握了以上所有知識，那就太好了！直接跳過本節，進入下一節。否則，以下是入門指南：

步驟 A：取得您的 API 金鑰

首次登入 AI Studio 時，系統會自動建立一個 Google Cloud 專案和一個 API 金鑰。

開啟API 金鑰管理介面，點選「複製」圖示複製您的 API 金鑰。

複製您的 API 金鑰

步驟二：啟用計費功能

由於 Nano Banana Pro 沒有免費套餐，您必須在 Google Cloud 專案中啟用結算功能。

在API 金鑰管理畫面中，按一下專案旁的「設定計費」 ，然後依照螢幕上的指示進行。

設定帳單

Nano Banana Pro 的價格是多少？

使用 Nano Banana Pro 生成圖像比使用 Flash 版本成本更高，尤其是生成 4K 圖像時。截至本文發佈時，生成一張 1K 或 2K 圖像需要花費0.134 美元，而生成一張 4K 圖像需要花費0.24 美元（外加輸入和文字輸出的代幣成本）。

請查看產品文件以取得最新定價詳情。

專業提示：使用批量 API可以節省 50% 的生成成本。但作為交換，您可能需要等待最多 24 小時才能取得影像。

步驟 C：安裝 SDK

選擇您首選語言的 SDK。

Python：

pip install -U google-genai
# Install the Pillow library for image manipulation
pip install Pillow

JavaScript / TypeScript：

npm install @google/genai

注意：以下範例使用 Python SDK 進行示範。使用 Nano Banana 的等效 JavaScript 程式碼片段請參考此JS Notebook 。

3）初始化客戶端

要使用 Pro 型號，您需要使用gemini-3-pro-image-preview型號 ID。

from google import genai
from google.genai import types

# Initialize the client
client = genai.Client(api_key="YOUR_API_KEY")

# Set the model ID
PRO_MODEL_ID = "gemini-3-pro-image-preview"

4）基本生成（經典）

在深入探討高級功能之前，讓我們先來看看一個標準的生成過程。您可以使用response_modalities （取得文字和映像或僅映像）和aspect_ratio來控制輸出。

prompt = "Create a photorealistic image of a siamese cat with a green left eye and a blue right one"
aspect_ratio = "16:9" # "1:1","2:3","3:2","3:4","4:3","4:5","5:4","9:16","16:9" or "21:9"

response = client.models.generate_content(
    model=PRO_MODEL_ID,
    contents=prompt,
    config=types.GenerateContentConfig(
        response_modalities=['Text', 'Image'], # Or just ['Image']
        image_config=types.ImageConfig(
            aspect_ratio=aspect_ratio,
        )
    )
)

# Save the image
for part in response.parts:
    if image:= part.as_image():
        image.save("cat.png")

暹羅貓

聊天模式也是一個選項（實際上，我推薦在進行多輪編輯時使用聊天模式）。可以參考第 8 個範例「Polyglot Banana」。

5）「思考」過程（它是鮮活的！）

Nano Banana Pro 不只是畫圖；它還會思考。這意味著它能夠理解你最複雜、最刁鑽的提示，然後再產生圖像。最棒的是什麼？你可以窺探它的「腦」！

若要啟用此功能，請在thinking_config中設定include_thoughts=True 。

prompt = "Create an unusual but realistic image that might go viral"
aspect_ratio = "16:9"

response = client.models.generate_content(
    model=PRO_MODEL_ID,
    contents=prompt,
    config=types.GenerateContentConfig(
        response_modalities=['Text', 'Image'],
        image_config=types.ImageConfig(
            aspect_ratio=aspect_ratio,
        ),
        thinking_config=types.ThinkingConfig(
            include_thoughts=True # Enable thoughts
        )
    )
)

# Save the image and thoughts
for part in response.parts:
  if part.thought:
    print(f"Thought: {part.text}")
  elif image:= part.as_image():
    image.save("viral.png")

你應該會收到類似這樣的內容：

## Imagining Llama Commuters

I'm focusing on the llamas now. The goal is to capture them as
daily commuters on a bustling bus in La Paz, Bolivia. My plan
involves a vintage bus crammed with amused passengers. The image
will highlight details like one llama looking out the window,
another interacting with a passenger, all while people take
photos.

[IMAGE]

## Visualizing the Concept

I'm now fully immersed in the requested scenario. My primary
focus is on the "unusual yet realistic" aspects. The scene is
starting to take shape with the key elements established.

病毒式傳播的圖片

這種透明度有助於您了解模型如何理解您的要求。就像在和您的藝術家對話一樣！

6）搜尋接地（即時魔法）

其中一項最具變革性的功能是「搜尋即時性」 。 Nano Banana Pro 不拘泥於過去；它可以存取 Google 搜尋的即時資料，產生準確、最新的圖像。想看天氣？沒問題。

例如，您可以讓它顯示目前天氣預報：

prompt = "Visualize the current weather forecast for the next 5 days in Tokyo as a clean, modern weather chart. add a visual on what i should wear each day"

response = client.models.generate_content(
    model=PRO_MODEL_ID,
    contents=prompt,
    config=types.GenerateContentConfig(
        response_modalities=['Text', 'Image'],
        image_config=types.ImageConfig(
            aspect_ratio="16:9",
        ),
        tools=[{"google_search": {}}] # Enable Google Search
    )
)

# Save the image
for part in response.parts:
    if image:= part.as_image():
        image.save("weather.png")

# Display sources (you must always do that)
print(response.candidates[0].grounding_metadata.search_entry_point.rendered_content)

東京天氣

7）要嘛大幹一場，要嘛回家：4K 世代

需要列印級影像？ Nano Banana Pro 支援 4K 解析度。因為有時候，越大越好。

prompt = "A photo of an oak tree experiencing every season"
resolution = "4K" # Options: "1K", "2K", "4K", be careful lower case do not work.

response = client.models.generate_content(
    model=PRO_MODEL_ID,
    contents=prompt,
    config=types.GenerateContentConfig(
        response_modalities=['Text', 'Image'],
        image_config=types.ImageConfig(
            aspect_ratio="1:1",
            image_size=resolution
        )
    )
)

橡樹經歷四季更迭

注意：4K 技術成本較高，請謹慎使用！

8) 多語言香蕉（具備多語言能力）

該模型可以生成圖像中的文本，甚至可以將其翻譯成十幾種語言。它簡直就是你眼睛的通用翻譯器。

# Generate an infographic in Spanish
message = "Make an infographic explaining Einstein's theory of General Relativity suitable for a 6th grader in Spanish"

response = chat.send_message(message,
    config=types.GenerateContentConfig(
        image_config=types.ImageConfig(aspect_ratio="16:9")
    )
)

# Save the image
for part in response.parts:
    if image:= part.as_image():
        image.save("relativity.png")

西班牙語中的廣義相對論

# Translate it to Japanese
message = "Translate this infographic in Japanese, keeping everything else the same"
response = chat.send_message(message)

# Save the image
for part in response.parts:
    if image:= part.as_image():
        image.save("relativity_JP.png")

日文中的廣義相對論

9) 混合搭配！（高級影像混合）

Flash 版最多可混合 3 張圖片，而 Pro 版最多可處理14 張圖片！一次操作即可呈現豐富多彩的內容。非常適合製作複雜的拼貼畫或展示您的全線產品。

# Mix multiple images
response = client.models.generate_content(
    model=PRO_MODEL_ID,
    contents=[
        "An office group photo of these people, they are making funny faces.",
        PIL.Image.open('John.png'),
        PIL.Image.open('Jane.png'),
        # ... add up to 14 images
    ],
)

# Save the image
for part in response.parts:
    if image:= part.as_image():
        image.save("group_picture.png")

圖片描述