{"name":"Phoenix: AI Observability and Evaluation Platform for LLMs","description":"Phoenix is an open-source AI observability platform from Arize AI, designed for comprehensive experimentation, evaluation, and troubleshooting of LLM applications. It provides robust features including OpenTelemetry-based tracing, LLM evaluation, and systematic prompt management. This platform helps developers optimize and debug their AI models effectively across various environments.","github":"https://github.com/Arize-ai/phoenix","url":"https://osrepos.com/repo/arize-ai-phoenix","source":"osrepos.com","sourceDescription":"This repository profile is provided by osrepos.com, an open source repository discovery platform.","repositoryProfile":"https://osrepos.com/repo/arize-ai-phoenix","generatedFor":"open source discovery and AI-assisted research","markdown":"https://osrepos.com/repo/arize-ai-phoenix.md","json":"https://osrepos.com/repo/arize-ai-phoenix.json","topics":["AI Observability","LLM Evaluation","Prompt Engineering","AI Monitoring","LangChain","LlamaIndex","Python","AI Agents"],"keywords":["AI Observability","LLM Evaluation","Prompt Engineering","AI Monitoring","LangChain","LlamaIndex","Python","AI Agents"],"stars":null,"summary":"Phoenix is an open-source AI observability platform from Arize AI, designed for comprehensive experimentation, evaluation, and troubleshooting of LLM applications. It provides robust features including OpenTelemetry-based tracing, LLM evaluation, and systematic prompt management. This platform helps developers optimize and debug their AI models effectively across various environments.","content":"## Introduction\n\nPhoenix is an open-source AI observability platform built by Arize AI, specifically engineered for the experimentation, evaluation, and troubleshooting of large language model (LLM) applications. It offers a comprehensive suite of tools to gain deep insights into your AI systems.\n\nKey capabilities include:\n\n*   **Tracing**: Instrument your LLM application's runtime using OpenTelemetry for detailed visibility.\n*   **Evaluation**: Benchmark application performance with LLM-powered response and retrieval evaluations.\n*   **Datasets**: Create versioned datasets for experimentation, evaluation, and fine-tuning.\n*   **Experiments**: Track and evaluate changes to prompts, LLMs, and retrieval strategies.\n*   **Playground**: Optimize prompts, compare models, adjust parameters, and replay traced LLM calls.\n*   **Prompt Management**: Systematically manage and test prompt changes with version control and tagging.\n*   **PXI (Phoenix Intelligence)**: An integrated AI engineering agent for debugging traces and iterating on prompts.\n\nPhoenix is designed to be vendor and language agnostic, providing out-of-the-box support for popular frameworks like OpenAI Agents SDK, LangChain, LlamaIndex, and DSPy, as well as LLM providers such as OpenAI, Anthropic, and Google GenAI. It runs flexibly on your local machine, in a Jupyter notebook, as a containerized deployment, or in the cloud.\n\n## Installation\n\nGetting started with Phoenix is straightforward. You can install the core package using `pip` or `conda`:\n\nshell\npip install arize-phoenix\n\n\nFor containerized deployments, Phoenix container images are available via [Docker Hub](https://hub.docker.com/r/arizephoenix/phoenix 'Docker Hub' target=\"_blank\") and can be deployed using Docker or Kubernetes. Arize AI also offers cloud instances at [app.phoenix.arize.com](https://app.phoenix.arize.com/ 'Arize AI Cloud' target=\"_blank\").\n\n## Examples\n\nPhoenix offers extensive integration examples across various LLM frameworks and providers, demonstrating its versatility. It supports popular Python frameworks such as OpenAI Agents SDK, LlamaIndex, LangChain, and DSPy, enabling seamless tracing and evaluation within these ecosystems. For JavaScript developers, integrations include the OpenAI Node SDK, LangChain.js, and Vercel AI SDK, ensuring broad compatibility.\n\nBeyond the main platform, Phoenix provides specialized lightweight Python sub-packages like `arize-phoenix-otel` for OpenTelemetry wrappers, `arize-phoenix-client` for API interaction, and `arize-phoenix-evals` for LLM evaluation tooling. Similar TypeScript sub-packages are also available. Additionally, the repository includes coding agent skills for platforms like Claude Code and Cursor, facilitating advanced debugging and evaluation workflows directly within your coding environment.\n\n## Why use Phoenix\n\nPhoenix stands out as a critical tool for anyone developing and deploying LLM applications. Its comprehensive AI observability features, including tracing, evaluation, and dataset management, provide unparalleled visibility into your models' behavior. By being vendor and language agnostic and built on OpenTelemetry, Phoenix ensures maximum flexibility and integration with your existing tech stack. The platform's ability to track and evaluate prompt and model changes through experiments, coupled with its intuitive playground, empowers developers to iterate and optimize their AI systems efficiently. With flexible deployment options and a strong community, Phoenix is an essential asset for debugging, improving, and maintaining robust LLM applications.\n\n## Links\n\n*   **GitHub Repository**: [https://github.com/Arize-ai/phoenix](https://github.com/Arize-ai/phoenix 'GitHub Repository' target=\"_blank\")\n*   **Documentation**: [https://arize.com/docs/phoenix/](https://arize.com/docs/phoenix/ 'Phoenix Documentation' target=\"_blank\")\n*   **Slack Community**: [https://join.slack.com/t/arize-ai/shared_invite/zt-3r07iavnk-ammtATWSlF0pSrd1DsMW7g](https://join.slack.com/t/arize-ai/shared_invite/zt-3r07iavnk-ammtATWSlF0pSrd1DsMW7g 'Slack Community' target=\"_blank\")\n*   **PyPI Package**: [https://pypi.org/project/arize-phoenix/](https://pypi.org/project/arize-phoenix/ 'PyPI Package' target=\"_blank\")","metrics":{"detailViews":2,"githubClicks":1},"dates":{"published":null,"modified":"2026-06-28T07:36:02.000Z"}}