v0.9.5
What's Changed
- Update controller grpc by @onur-askui in #94
- feat: set askui controller path directly by @adi-wan-askui in #105
- Fix UnicodeEncodeError by @programminx-askui in #106
- feat: add display management tools and enhance VisionAgent by @danyalxahid-askui in #103
🚀 New Features
-
Computer Tool support of "cursor_position" action":
VisionAgentcan now retrieve the current cursor position, e.g., to answer questions like Is the cursor currently hovering over the star icon button? -
Multiple Display Support: Comprehensive multi-display functionality for computer vision agents
- New Display Management Tools ⟶ e.g.,
VisionAgentsearches now through all available displays if something cannot be found on one:ListDisplaysTool: List all available displays with their propertiesSetActiveDisplayTool: Set the active display for screenshots and actionsRetrieveActiveDisplayTool: Get information about the currently active display
- New Display Management Tools ⟶ e.g.,
-
AskUI Controller Path Configuration: Enhanced flexibility in controller setup
- Direct Path Setting: New
ASKUI_CONTROLLER_PATHenvironment variable for direct controller executable specification - Priority-Based Resolution: Controller path resolution with precedence: direct path > component registry > installation directory
- Cross-Platform Support: Improved path resolution for Windows, macOS, and Linux
- Better Error Handling: Clear error messages for missing or invalid controller paths
- Direct Path Setting: New
-
Modernized GRPC Controller Architecture: Major overhaul of the controller communication system
- JSON Schema Integration: New JSON schema definitions for
AgentOS-Send-Request-2501andAgentOS-Send-Response-2501 - Automated Code Generation:
- New
grpc:genandjson:genPDM scripts for automated code generation - Generated Python classes from JSON schemas using
datamodel-code-generator - Updated GRPC bindings with enhanced type safety
- New
- Enhanced Command Helpers: New
command_helpers.pymodule with functions for:- Mouse position management (
create_get_mouse_position_command,create_set_mouse_position_command) - Render object management (quad, line, image, text commands)
- Styling system with
create_stylefunction supporting CSS-like properties
- Mouse position management (
- Improved Proto Definitions: Updated
Controller_V1.protowith expanded command support
- JSON Schema Integration: New JSON schema definitions for
🔧 Improvements
- Development Workflow Enhancements:
- New Dev Dependency Group: Separated development dependencies (
datamodel-code-generator,grpcio-tools) into dedicated group - Automated Generation Scripts: New
scripts/grpc-gen.shfor GRPC code generation
- New Dev Dependency Group: Separated development dependencies (
🐞 Bug Fixes
- Unicode Encoding: Fixed encoding in chat api persistence layer
🔄 Dependencies
- Moved to Dev Dependencies:
grpcio-tools>=1.73.1: Moved from core to dev dependencies (was>=1.67.0)datamodel-code-generator>=0.31.2: Added for automated code generation
Full Changelog: v0.9.4...v0.9.5