refactor ui-tar-api by mlikasam-askui · Pull Request #44 · askui/python-sdk

mlikasam-askui · 2025-04-24T08:58:14Z

No description provided.

mlikasam-askui · 2025-04-24T09:06:21Z

the timeouts are magic numbers 🤣 based on the old code.
You're welcome to change or close the PR if it doesn't seem needed.

programminx-askui

Did you tried it without time.sleep?

And can we change the Readme.md so that we are now supporting UI-Tars 1.5?

adi-wan-askui · 2025-04-28T07:17:10Z

src/askui/models/ui_tars_ep/parser.py

+    def execute(self, _agent_os: AgentOs) -> None:
+        raise Exception("Call user action executed. This should be handled by the agent's logic, not directly here.")


Just thinking out loud where we could develop this in the future (not now): If we handed over other tools instead of only the AgentOs, e.g., one for getting somehow data from the user, e.g., through a console, or just handing over control to the user busy waiting until the user confirms that he/she has helped out (potentially with some explanation of how he/she helped out)., we would be able to handle this here.

adi-wan-askui · 2025-04-28T07:17:23Z

src/askui/models/ui_tars_ep/ui_tars_api.py

        )
        raw_message = chat_completion.choices[-1].message.content
-        print(raw_message)
+        logger.debug(f"Raw message: {raw_message}")


adi-wan-askui · 2025-04-28T07:17:43Z

src/askui/models/ui_tars_ep/ui_tars_api.py

        self.execute_act(self.act_history)

    def add_screenshot_to_history(self, message_history):
+        time.sleep(0.5)


Why was this necessary?

adi-wan-askui · 2025-04-28T07:18:15Z

src/askui/models/ui_tars_ep/ui_tars_api.py

+        if isinstance(action, FinishedAction):
            return
+
+        action.execute(self._agent_os)


Beautiful refactoring :)

Using pydantic for parsing + using the strategy pattern

adi-wan-askui · 2025-04-28T07:19:15Z

src/askui/models/ui_tars_ep/parser.py

-class FinishedAction(BaseModel):
-    """Finished action."""
+    def execute(self, _agent_os: AgentOs) -> None:
+        time.sleep(5)


Just thinking out loud where we could develop this in the future (not now): If we handed over other tools instead of only the AgentOs, e.g., one for waiting, we would be able to handle this here.

adi-wan-askui · 2025-04-28T07:23:36Z

src/askui/models/ui_tars_ep/parser.py

-    """Hotkey action with key combination."""
+    def execute(self, agent_os: AgentOs) -> None:
+        agent_os.mouse(x=self.start_box.x, y=self.start_box.y)
+        time.sleep(0.2)


I would move this into the AskUiControllerClient as this from my perspective depends on the underlying AgentOs implementation and is the general sleep/wait time between actions to ensure the agent os implementation, os, application etc. had time to react. It is generally model-independent.

Also I would make this configurable through the constructor of AskUiControllerClient as it is highly dependent on os, application being automated etc.

FYI: Just saw that there are already 2 properties (pre_action_wait and post_action_wait) that may just need value adjusting and exposing through the constructor.

adi-wan-askui · 2025-04-28T07:27:39Z

src/askui/models/ui_tars_ep/parser.py

+    action_type: str
+
+    def execute(self, agent_os: AgentOs) -> None:
+        raise NotImplementedError(f"Action '{self.action_type}' must implement execute method.")


Suggested change

raise NotImplementedError(f"Action '{self.action_type}' must implement execute method.")

raise NotImplementedError(f"Action '{self.action_type}' not implemented yet")

refactor ui-tar-api

f480d79

mlikasam-askui requested a review from adi-wan-askui April 24, 2025 08:58

mlikasam-askui self-assigned this Apr 24, 2025

programminx-askui reviewed Apr 25, 2025

View reviewed changes

adi-wan-askui suggested changes Apr 28, 2025

View reviewed changes

mlikasam-askui closed this Jun 20, 2025

mlikasam-askui deleted the fix-ui-tar-api-handler branch August 27, 2025 08:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

refactor ui-tar-api#44

refactor ui-tar-api#44
mlikasam-askui wants to merge 1 commit intomainfrom
fix-ui-tar-api-handler

mlikasam-askui commented Apr 24, 2025

Uh oh!

mlikasam-askui commented Apr 24, 2025

Uh oh!

programminx-askui left a comment

Uh oh!

adi-wan-askui Apr 28, 2025

Uh oh!

adi-wan-askui Apr 28, 2025

Uh oh!

adi-wan-askui Apr 28, 2025

Uh oh!

adi-wan-askui Apr 28, 2025

Uh oh!

adi-wan-askui Apr 28, 2025

Uh oh!

adi-wan-askui Apr 28, 2025

Uh oh!

adi-wan-askui Apr 28, 2025

Uh oh!

adi-wan-askui Apr 28, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		def execute(self, _agent_os: AgentOs) -> None:
		raise Exception("Call user action executed. This should be handled by the agent's logic, not directly here.")

	raise NotImplementedError(f"Action '{self.action_type}' must implement execute method.")
	raise NotImplementedError(f"Action '{self.action_type}' not implemented yet")

Conversation

mlikasam-askui commented Apr 24, 2025

Uh oh!

mlikasam-askui commented Apr 24, 2025

Uh oh!

programminx-askui left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants