# Phoenix: AI Observability and Evaluation Platform for LLMs

This repository profile is provided by osrepos.com, an open source repository discovery platform.

Source: osrepos.com
Repository profile: https://osrepos.com/repo/arize-ai-phoenix
Generated for open source discovery and AI-assisted research.

Phoenix is an open-source AI observability platform from Arize AI, designed for comprehensive experimentation, evaluation, and troubleshooting of LLM applications. It provides robust features including OpenTelemetry-based tracing, LLM evaluation, and systematic prompt management. This platform helps developers optimize and debug their AI models effectively across various environments.

GitHub: https://github.com/Arize-ai/phoenix
OSRepos URL: https://osrepos.com/repo/arize-ai-phoenix

## Summary

Phoenix is an open-source AI observability platform from Arize AI, designed for comprehensive experimentation, evaluation, and troubleshooting of LLM applications. It provides robust features including OpenTelemetry-based tracing, LLM evaluation, and systematic prompt management. This platform helps developers optimize and debug their AI models effectively across various environments.

## Topics

- AI Observability
- LLM Evaluation
- Prompt Engineering
- AI Monitoring
- LangChain
- LlamaIndex
- Python
- AI Agents

## Repository Information

Last analyzed by OSRepos: Sun Jun 28 2026 08:36:02 GMT+0100 (Western European Summer Time)
Detail views: 2
GitHub clicks: 1

## Safety Notice

OSRepos shares public repositories for knowledge and discovery only. Review source code, dependencies, licenses, and security implications before running or installing anything.

## Content

## Introduction

Phoenix is an open-source AI observability platform built by Arize AI, specifically engineered for the experimentation, evaluation, and troubleshooting of large language model (LLM) applications. It offers a comprehensive suite of tools to gain deep insights into your AI systems.

Key capabilities include:

*   **Tracing**: Instrument your LLM application's runtime using OpenTelemetry for detailed visibility.
*   **Evaluation**: Benchmark application performance with LLM-powered response and retrieval evaluations.
*   **Datasets**: Create versioned datasets for experimentation, evaluation, and fine-tuning.
*   **Experiments**: Track and evaluate changes to prompts, LLMs, and retrieval strategies.
*   **Playground**: Optimize prompts, compare models, adjust parameters, and replay traced LLM calls.
*   **Prompt Management**: Systematically manage and test prompt changes with version control and tagging.
*   **PXI (Phoenix Intelligence)**: An integrated AI engineering agent for debugging traces and iterating on prompts.

Phoenix is designed to be vendor and language agnostic, providing out-of-the-box support for popular frameworks like OpenAI Agents SDK, LangChain, LlamaIndex, and DSPy, as well as LLM providers such as OpenAI, Anthropic, and Google GenAI. It runs flexibly on your local machine, in a Jupyter notebook, as a containerized deployment, or in the cloud.

## Installation

Getting started with Phoenix is straightforward. You can install the core package using `pip` or `conda`:

shell
pip install arize-phoenix


For containerized deployments, Phoenix container images are available via [Docker Hub](https://hub.docker.com/r/arizephoenix/phoenix 'Docker Hub' target="_blank") and can be deployed using Docker or Kubernetes. Arize AI also offers cloud instances at [app.phoenix.arize.com](https://app.phoenix.arize.com/ 'Arize AI Cloud' target="_blank").

## Examples

Phoenix offers extensive integration examples across various LLM frameworks and providers, demonstrating its versatility. It supports popular Python frameworks such as OpenAI Agents SDK, LlamaIndex, LangChain, and DSPy, enabling seamless tracing and evaluation within these ecosystems. For JavaScript developers, integrations include the OpenAI Node SDK, LangChain.js, and Vercel AI SDK, ensuring broad compatibility.

Beyond the main platform, Phoenix provides specialized lightweight Python sub-packages like `arize-phoenix-otel` for OpenTelemetry wrappers, `arize-phoenix-client` for API interaction, and `arize-phoenix-evals` for LLM evaluation tooling. Similar TypeScript sub-packages are also available. Additionally, the repository includes coding agent skills for platforms like Claude Code and Cursor, facilitating advanced debugging and evaluation workflows directly within your coding environment.

## Why use Phoenix

Phoenix stands out as a critical tool for anyone developing and deploying LLM applications. Its comprehensive AI observability features, including tracing, evaluation, and dataset management, provide unparalleled visibility into your models' behavior. By being vendor and language agnostic and built on OpenTelemetry, Phoenix ensures maximum flexibility and integration with your existing tech stack. The platform's ability to track and evaluate prompt and model changes through experiments, coupled with its intuitive playground, empowers developers to iterate and optimize their AI systems efficiently. With flexible deployment options and a strong community, Phoenix is an essential asset for debugging, improving, and maintaining robust LLM applications.

## Links

*   **GitHub Repository**: [https://github.com/Arize-ai/phoenix](https://github.com/Arize-ai/phoenix 'GitHub Repository' target="_blank")
*   **Documentation**: [https://arize.com/docs/phoenix/](https://arize.com/docs/phoenix/ 'Phoenix Documentation' target="_blank")
*   **Slack Community**: [https://join.slack.com/t/arize-ai/shared_invite/zt-3r07iavnk-ammtATWSlF0pSrd1DsMW7g](https://join.slack.com/t/arize-ai/shared_invite/zt-3r07iavnk-ammtATWSlF0pSrd1DsMW7g 'Slack Community' target="_blank")
*   **PyPI Package**: [https://pypi.org/project/arize-phoenix/](https://pypi.org/project/arize-phoenix/ 'PyPI Package' target="_blank")