A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
-
Updated
Mar 15, 2025 - TypeScript
A GUI Agent application based on UI-TARS(Vision-Lanuage Model) that allows you to control your computer using natural language.
Reliable Automation Agents at Scale
PsyDI: Towards a Personalized and Progressively In-depth Chatbot for Psychological Measurements. (e.g. MBTI Measurement Agent)
This repository contains the frontend code for Ailert.tech build on Next.js, Tailwind CSS, and Python.
An AI-powered location discovery system using multi-modal data (text, images, reviews, real-time factors)
Add a description, image, and links to the vlm topic page so that developers can more easily learn about it.
To associate your repository with the vlm topic, visit your repo's landing page and select "manage topics."