Skip to content

[Image Generation] Explore Nano Banana Pro capabilities for improved visual output #31

@madjin

Description

@madjin

Summary

Research into Nano Banana Pro (Gemini 3 Pro) capabilities reveals features that could benefit our poster generation pipeline. This issue tracks exploration of these capabilities.

Resources

Capabilities Worth Exploring

Feature Potential Use Case
Google Search grounding Accurate brand/project logos, current events imagery
Thinking mode Complex multi-element compositions
Multi-turn editing Iterative style refinement
Text rendering Infographic-style posters, data visualizations
Transparent backgrounds Cleaner icon/asset generation

Prompting Patterns to Test

From awesome-nanobanana-pro, these techniques show strong results:

  1. Narrative paragraph prompts vs keyword lists
  2. Camera/lens specifications for consistent rendering
  3. Lighting descriptions for photorealistic outputs
  4. Era-specific aesthetic triggers
  5. Explicit composition guidance

Exploration Tasks

  • Test Google Search grounding for brand logo accuracy
  • Compare narrative vs structured prompts for our use cases
  • Experiment with thinking mode for complex scenes
  • Test multi-turn editing for style iteration
  • Document what works best for our content (ElizaOS dev updates)

Related Files

  • scripts/posters/generate-icons.py - Icon generation
  • scripts/posters/generate-ai-image.py - Daily poster generation
  • scripts/posters/config/style-presets.json - Existing style templates

Priority: Low - exploratory research for future improvements

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions