Skip to content

Releases: askui/python-sdk

v0.12.1

15 Aug 13:54

Choose a tag to compare

What's Changed

🚀 Features

  • feat: AgentBase.locate_all() to locate (points) of all element matching a locator by @mlikasam-askui in #118

🐛 Bug Fixes

Full Changelog: v0.12.0...v0.12.1

v0.12.0

15 Aug 09:20

Choose a tag to compare

What's Changed

🚀 Features

  • feat(get): support PDF processing and refactor image handling by @danyalxahid-askui in #115
    • Updated the get method to accept both images and PDFs as sources (size up to 20MB)
    • Renamed the image parameter to source for clarity
    • Added support for PDF processing in ASKUI-hosted gemini models (default)

🐛 Bug Fixes

  • fix: change ModelName to a regular class so that constants can be used like regular strings in Python >3.10 by @adi-wan-askui in #116
    • Fixes issues with using Gemini where "ModelName.GEMINI__2_5__FLASH" was passed as model id instead of "gemini-2.5-flash"
  • fix: encoding issue by @mlikasam-askui in #117

🚨 Breaking Changes

  • Renamed parameter of AgentBase.get() method from image to source
  • Data URL (source of AgentBase.get()) now requires the mimetype to be specified (e.g., "data:application/pdf;base64,JVBERi...")

Full Changelog: v0.11.0...v0.12.0

v0.11.0

11 Aug 09:11
7ef6439

Choose a tag to compare

What's Changed

• Re release of the previous release 0.10.5 as a major upgrade because of breaking changes

Full Changelog: v0.10.5...v0.11.0

v0.10.5

11 Aug 08:17

Choose a tag to compare

What's Changed

🐞 Bug Fixes

  • Issue with Tars Model Initialization

🚨 Breaking Changes

  • UI-TARS: If you want to use UI-TARS, it is from now on required to provide a model name using the env variable TARS_MODEL_NAME.

Full Changelog: v0.10.4...v0.10.5

v0.10.4

07 Aug 09:28
18f7bd2

Choose a tag to compare

What's Changed

🐞 Bug Fixes

Full Changelog: v0.10.3...v0.10.4

v0.10.3

01 Aug 09:45

Choose a tag to compare

What's Changed

  • refactor(gemini): retry on more error codes & interpret retry-after header by @adi-wan-askui in #112

Full Changelog: v0.10.2...v0.10.3

v0.10.2

31 Jul 16:34

Choose a tag to compare

🐞 Bug Fixes

  • Optional dependencies (regression): Fixed that importing VisionAgent failed when installing askui via pip install askui as optional dependencies were imported although not available (made fastapi a required dependency as some tools require it)

Full Changelog: v0.10.1...v0.10.2

v0.10.1

31 Jul 16:20

Choose a tag to compare

What's Changed

🐞 Bug Fixes

  • Optional dependencies: Fixed that importing VisionAgent failed when installing askui via pip install askui as optional dependencies were imported although not available
  • Stability: Made AskUI model usage more stable, 502 errors from API are now also retried and retry logic is improved with it so we retry 3 times (30, 60, 120 s)

Full Changelog: v0.10.0...v0.10.1

v0.10.0

30 Jul 15:03

Choose a tag to compare

What's Changed

🚀 New Features

  • Google Gemini API Support: The askui model now uses gemini-2.5-flash as the default model, falling back to the original askui model (Inference API's VQA endpoint) if the Google GenAI API fails, e.g., because of missing support of schema or for unknown reason. For example, Google GenAI API does not support recursive schemas at the moment.
  • New Model Options: askui/gemini-2.5-flash and askui/gemini-2.5-pro are now supported as model choices.

🚨 Breaking Changes

  • Default Model Change: The askui default model for AgentBase.get() (and, therefore, VisionAgent.get() etc.) has changed, which may affect the behavior of existing implementations.

Full Changelog: v0.9.7...v0.10.0

v0.9.7

30 Jul 08:27

Choose a tag to compare

What's Changed

Rerelease of v0.9.6 due to a problem while releasing.

Full Changelog: v0.9.6...v0.9.7