Repository History

Explore all analyzed open source repositories

Topic: Browser Automation
UI-TARS-desktop: The Open-Source Multimodal AI Agent Stack

UI-TARS-desktop: The Open-Source Multimodal AI Agent Stack

UI-TARS-desktop is an open-source multimodal AI Agent stack from ByteDance, designed to connect cutting-edge AI models with agent infrastructure. It provides both Agent TARS, a general multimodal AI agent with CLI and Web UI, and UI-TARS Desktop, a native GUI agent for local and remote computer/browser control. This powerful tool aims to enable human-like task completion through rich multimodal capabilities and seamless integration with real-world tools.

May 6, 2026
View Details
YouTube Transcripts Machine: Extract Timestamps and Transcripts from Videos

YouTube Transcripts Machine: Extract Timestamps and Transcripts from Videos

YouTube Transcripts Machine (YTM) is a web application that automates the extraction of timestamps and transcripts from any YouTube video. It provides a user-friendly interface to view, interact with, and export video transcripts, enhancing accessibility and content analysis for users.

Apr 15, 2026
View Details
Magentic-UI: A Human-Centered AI Agent for Web Automation

Magentic-UI: A Human-Centered AI Agent for Web Automation

Magentic-UI is a research prototype from Microsoft that introduces a human-centered AI agent designed to automate complex web and coding tasks. Unlike black-box agents, it prioritizes transparency and user control, revealing its plans, allowing guidance, and seeking approval for sensitive operations. This innovative system empowers users to automate workflows while maintaining oversight and intervention capabilities.

Mar 30, 2026
View Details
vibe-tools: Empowering AI Agents with Teams and Advanced Skills

vibe-tools: Empowering AI Agents with Teams and Advanced Skills

vibe-tools is a powerful CLI designed to enhance AI agents by providing them with an AI team and advanced skills. It integrates tools like Perplexity for web research, Gemini for repository context, and Stagehand for browser automation. Optimized for Cursor Composer Agent, vibe-tools can be utilized by any coding agent capable of executing commands.

Jan 12, 2026
View Details
Page 1