We are happy to release MMBench-GUI, a hierarchical, multi-platform benchmark framework and toolbox, to evaluate GUI agents. MMBench-GUI is comprising four evaluation levels: GUI Content Understanding ...
Abstract: Addressing the critical challenge of spatiotemporal semantic disjunction caused by conventional bird’s-eye view trajectory modeling methods in ego-vehicle perspective road user behavior ...
Abstract: With the increasing number of people with disabilities and other social problems, brain-computer interface (BCI) plays an increasingly important role in the field of medical rehabilitation.
A Windows desktop application (WinForms) that provides a one-stop GUI for various system maintenance tasks—like SFC scans, DISM checks, clearing temp files/cache, and more. This tool is especially ...
One of the principal challenges in building VLM-powered GUI agents is visual grounding, i.e., localizing the appropriate screen region for action execution based on both the visual content and the ...