Overview

Tool Name

image_tools

Purpose

The image_tools group provides AI vision and image generation capabilities. You can ask natural language questions about images, create new images from prompts, and locate or retrieve image files that live in your workspace.

Key Features & Functions

Visual Analysis

Ask questions about charts, documents, UI screenshots, and scenes to extract text, structure, and context.

Text-to-Image

Generate illustrations, diagrams, and mockups from concise prompts with optional constraints.

File Management

Find and fetch images by name or path for reuse in workflows.

Batch Friendly

Process multiple images in sequence for consistent analysis at scale.

Input Parameters for Each Function

_image_analysis

Parameters
NameDefinitionFormat
file_nameFull local path to the image to analyze.String
queryNatural language question about the image content.String

ImageAnalysis (alternative analysis with OpenAI file id support)

Parameters
NameDefinitionFormat
file_nameFull local path to the image to analyze.String
openai_file_idOpenAI file id of the image if already uploaded.String
queryNatural language question about the image content.String
Be explicit in your questions.
Example: “What is the Q2 total in the bottom-right bar and what color is that bar?”

image_generation

Parameters
NameDefinitionFormat
promptText description of the image to create.String

find_image_path_in_download_folder

Parameters
NameDefinitionFormat
file_nameFile name to locate in the downloads.String

GetImage

Parameters
NameDefinitionFormat
file_nameName of the stored image file to retrieve.String

Use Cases

  1. Chart understanding Extract totals, trends, or anomalies from screenshots of dashboards.
  2. Document capture Read invoices, forms, and scanned PDFs to pull text and key fields.
  3. Asset creation Generate icons, diagrams, or hero images for reports and presentations.
  4. Product review Identify defects or features across batches of product photos.
  5. UX analysis Describe layout, hierarchy, and accessibility concerns in UI screenshots.
Check license and rights before generating or distributing images that resemble branded assets, people, or copyrighted material.

Workflow/How It Works

  1. Provide an image or prompt For analysis, supply a path or file id and a clear question. For generation, write a concise prompt with style or size hints.
  2. Run the function The tool routes to a vision or image model, processes your request, and returns structured text or a new image artifact.
  3. Store and reuse Register generated images with your file manager and attach them to tasks or projects. Retrieve them later by name or id.
For consistent results across a set, script a loop that calls the same analysis query or prompt while iterating over file names.

Integration Relevance

  • file_manager_tools to register, attach, and share images with entities.
  • artifact_manager_tools to persist and link generated outputs.
  • project_manager_tools to track visual assets by mission or task.
  • document_index_tools to index extracted text from image-based documents.
  • git_action for versioned storage of static image assets.

Configuration Details

  • Use common formats like PNG or JPEG for best compatibility.
  • Paths must be accessible from the runtime environment.
  • Prompts should describe subject, style, composition, and any constraints such as aspect ratio.
  • When available, prefer openai_file_id for images already uploaded to a model store.

Limitations or Notes

  1. Analysis quality depends on resolution and clarity of the source image.
  2. Very small text or dense tables may require higher resolution inputs.
  3. Generated images may vary from the prompt when instructions are vague.
  4. Large or many images can increase processing time and memory usage.
  5. Corrupted or unsupported files will return detailed error messages.

Output

  • Image Analysis Descriptions of objects, text, layout, colors, and relationships. May include extracted text and confidence indicators.
  • Image Generation New image files with artifact ids and links for download or attachment.
  • File Location and Retrieval Resolved paths for found files and direct content for requested images.
  • Errors Clear messages for missing files, unsupported formats, or processing failures.