{"name":"UFO: Microsoft's Multi-Device AI Agent Orchestration Framework","description":"Microsoft's UFO project introduces a powerful framework for intelligent automation, evolving from a robust Windows Desktop AgentOS (UFO²) to a revolutionary Multi-Device Agent Galaxy (UFO³). This project enables the orchestration of AI agents across diverse platforms, streamlining complex workflows and enhancing digital interaction. It offers both standalone Windows automation and a scalable solution for cross-device collaboration.","github":"https://github.com/microsoft/UFO","url":"https://osrepos.com/repo/microsoft-ufo","source":"osrepos.com","sourceDescription":"This repository profile is provided by osrepos.com, an open source repository discovery platform.","repositoryProfile":"https://osrepos.com/repo/microsoft-ufo","generatedFor":"open source discovery and AI-assisted research","markdown":"https://osrepos.com/repo/microsoft-ufo.md","json":"https://osrepos.com/repo/microsoft-ufo.json","topics":["agent","automation","llm","windows","python","multi-device","orchestration","ai"],"keywords":["agent","automation","llm","windows","python","multi-device","orchestration","ai"],"stars":null,"summary":"Microsoft's UFO project introduces a powerful framework for intelligent automation, evolving from a robust Windows Desktop AgentOS (UFO²) to a revolutionary Multi-Device Agent Galaxy (UFO³). This project enables the orchestration of AI agents across diverse platforms, streamlining complex workflows and enhancing digital interaction. It offers both standalone Windows automation and a scalable solution for cross-device collaboration.","content":"## Introduction\n\nUFO, short for \"UFO³: Weaving the Digital Agent Galaxy,\" is an innovative open-source project from Microsoft designed to revolutionize intelligent automation. Written primarily in Python, UFO has garnered significant attention with over 8,100 stars and 1,000 forks, reflecting its impact in the AI agent and automation landscape.\n\nThe project has evolved through three major phases:\n*   **UFO (February 2024):** The original UI-Focused agent for Windows.\n*   **UFO² (April 2025):** Evolved into a Desktop AgentOS, offering stable and battle-tested single Windows automation.\n*   **UFO³ Galaxy (November 2025):** The latest and most advanced iteration, introducing a multi-device orchestration framework capable of coordinating intelligent agents across heterogeneous platforms.\n\nUFO³ combines the power of the new Galaxy framework for multi-device orchestration with the proven capabilities of UFO² as a robust Windows device agent. This allows users to tackle everything from simple desktop tasks to complex, cross-device workflows.\n\n## Installation\n\nGetting started with UFO involves choosing between the UFO³ Galaxy for multi-device orchestration or UFO² for Windows-specific automation. Both require Python and LLM API configuration.\n\n**UFO³ Galaxy Quick Start (For cross-device orchestration)**\n\n1.  **Install Dependencies:**\n    powershell\n    pip install -r requirements.txt\n    \n2.  **Configure ConstellationAgent:**\n    powershell\n    copy config\\galaxy\\agent.yaml.template config\\galaxy\\agent.yaml\n    \n    Edit `config\\galaxy\\agent.yaml` to add your API keys (e.g., OpenAI or Azure OpenAI).\n3.  **Configure Devices:**\n    Edit `config\\galaxy\\devices.yaml` to register your devices (Windows, Linux, Android).\n4.  **Start Device Agents:**\n    Follow platform-specific guides to start server and client components for each device.\n5.  **Launch Galaxy:**\n    powershell\n    python -m galaxy --interactive\n    \n\n**UFO² Quick Start (For Windows automation)**\n\n1.  **Install Dependencies:**\n    powershell\n    pip install -r requirements.txt\n    \n2.  **Configure LLMs:**\n    powershell\n    copy config\\ufo\\agents.yaml.template config\\ufo\\agents.yaml\n    \n    Edit `config\\ufo\\agents.yaml` to add your API keys.\n3.  **Run UFO²:**\n    powershell\n    python -m ufo --task <task_name>\n    \n\n**Common LLM Configuration**\n\nBoth frameworks require LLM API configuration. Here's an example for OpenAI:\n\n**For Galaxy (`config/galaxy/agent.yaml`):**\nyaml\nCONSTELLATION_AGENT:\n  REASONING_MODEL: false\n  API_TYPE: \"openai\"\n  API_BASE: \"https://api.openai.com/v1/chat/completions\"\n  API_KEY: \"sk-your-key-here\"\n  API_MODEL: \"gpt-4o\"\n\n\n**For UFO² (`config/ufo/agents.yaml`):**\nyaml\nVISUAL_MODE: True\nAPI_TYPE: \"openai\"\nAPI_BASE: \"https://api.openai.com/v1/chat/completions\"\nAPI_KEY: \"sk-your-key-here\"\nAPI_MODEL: \"gpt-4o\"\n\nMore LLM options (Qwen, Gemini, Claude) are available in the [official documentation](https://microsoft.github.io/UFO/supported_models/overview/){:target=\"_blank\"}.\n\n## Examples\n\nUFO³ Galaxy excels at orchestrating complex workflows across multiple devices, breaking down tasks into executable DAGs (Directed Acyclic Graphs) and coordinating agents on different platforms. For instance, it can manage a task that requires data extraction on a Linux server, processing on a Windows machine, and final reporting on a mobile device.\n\nUFO², on the other hand, is optimized for single Windows automation, performing tasks like interacting with GUI elements, automating application workflows, and integrating with native Windows OS functionalities. It can serve as a powerful device agent within the larger UFO³ Galaxy framework.\n\nYou can watch UFO³ Galaxy in action orchestrating cross-device tasks on their [official YouTube channel](https://www.youtube.com/watch?v=NGrVWGcJL8o){:target=\"_blank\"}.\n\n## Why Use UFO?\n\nUFO offers a versatile solution for intelligent automation, catering to different needs:\n\n**UFO³ Multi-Device Agent Galaxy (New & Recommended):**\n*   **Cross-device collaboration:** Ideal for workflows spanning multiple operating systems and devices.\n*   **Complex multi-step automation:** Handles intricate tasks with DAG-based orchestration.\n*   **Heterogeneous platform integration:** Supports Windows, Linux, Android, and more.\n*   **Dynamic DAG editing:** Adapts workflows based on execution feedback.\n*   **Unified AIP protocol:** Ensures secure and fault-tolerant agent communication.\n\n**UFO² Desktop AgentOS (Stable & Battle-Tested):**\n*   **Single Windows automation:** Perfect for desktop-specific tasks.\n*   **Quick task execution:** Streamlined for rapid automation.\n*   **Deep Windows OS integration:** Leverages UIA, Win32, and WinCOM for robust control.\n*   **Hybrid GUI + API actions:** Combines visual interaction with programmatic calls for efficiency.\n*   **Long-Term Support (LTS):** Actively maintained with ongoing bug fixes and improvements.\n\nUFO² can also seamlessly serve as a Windows device agent within the UFO³ Galaxy framework, providing a flexible migration path for users looking to scale their automation capabilities.\n\n## Links\n\n*   **GitHub Repository:** [https://github.com/microsoft/UFO](https://github.com/microsoft/UFO){:target=\"_blank\"}\n*   **Full Documentation:** [https://microsoft.github.io/UFO/](https://microsoft.github.io/UFO/){:target=\"_blank\"}\n*   **UFO³ Galaxy Quick Start:** [https://microsoft.github.io/UFO/getting_started/quick_start_galaxy/](https://microsoft.github.io/UFO/getting_started/quick_start_galaxy/){:target=\"_blank\"}\n*   **UFO² Documentation:** [https://github.com/microsoft/UFO/blob/main/ufo/README.md](https://github.com/microsoft/UFO/blob/main/ufo/README.md){:target=\"_blank\"}\n*   **YouTube Channel:** [https://www.youtube.com/watch?v=NGrVWGcJL8o](https://www.youtube.com/watch?v=NGrVWGcJL8o){:target=\"_blank\"}\n*   **GitHub Discussions:** [https://github.com/microsoft/UFO/discussions](https://github.com/microsoft/UFO/discussions){:target=\"_blank\"}\n*   **Issue Tracker:** [https://github.com/microsoft/UFO/issues](https://github.com/microsoft/UFO/issues){:target=\"_blank\"}","metrics":{"detailViews":5,"githubClicks":5},"dates":{"published":null,"modified":"2026-03-20T16:31:33.000Z"}}