Repository History
Explore all analyzed open source repositories

UI-TARS-desktop: The Open-Source Multimodal AI Agent Stack
UI-TARS-desktop is an open-source multimodal AI Agent stack from ByteDance, designed to connect cutting-edge AI models with agent infrastructure. It provides both Agent TARS, a general multimodal AI agent with CLI and Web UI, and UI-TARS Desktop, a native GUI agent for local and remote computer/browser control. This powerful tool aims to enable human-like task completion through rich multimodal capabilities and seamless integration with real-world tools.

YouTube Transcripts Machine: Extract Timestamps and Transcripts from Videos
YouTube Transcripts Machine (YTM) is a web application that automates the extraction of timestamps and transcripts from any YouTube video. It provides a user-friendly interface to view, interact with, and export video transcripts, enhancing accessibility and content analysis for users.

Magentic-UI: A Human-Centered AI Agent for Web Automation
Magentic-UI is a research prototype from Microsoft that introduces a human-centered AI agent designed to automate complex web and coding tasks. Unlike black-box agents, it prioritizes transparency and user control, revealing its plans, allowing guidance, and seeking approval for sensitive operations. This innovative system empowers users to automate workflows while maintaining oversight and intervention capabilities.

vibe-tools: Empowering AI Agents with Teams and Advanced Skills
vibe-tools is a powerful CLI designed to enhance AI agents by providing them with an AI team and advanced skills. It integrates tools like Perplexity for web research, Gemini for repository context, and Stagehand for browser automation. Optimized for Cursor Composer Agent, vibe-tools can be utilized by any coding agent capable of executing commands.