ROMA: Recursive Open Meta-Agents for High-Performance Multi-Agent Systems

This repository profile is provided by osrepos.com, an open source repository discovery platform.

ROMA: Recursive Open Meta-Agents for High-Performance Multi-Agent Systems

Summary

ROMA is a powerful meta-agent framework designed for building high-performance multi-agent systems using recursive hierarchical structures. It simplifies complex problem-solving by breaking tasks into parallelizable components, offering transparent development and proven performance. This open-source framework is extensible, allowing developers to customize agents and benefit from community-driven improvements.

Repository Information

Analyzed by OSRepos on November 23, 2025

Use at your own risk

OSRepos shares public repositories for knowledge and discovery only. Any installation, execution, configuration, or use of code from these repositories is the user's own responsibility. Always review the repository, source code, dependencies, licenses, and security implications before running or installing anything. OSRepos is not responsible for issues, damages, or losses resulting from third-party repositories.

Introduction

ROMA (Recursive Open Meta-Agents) is a powerful meta-agent framework designed to build high-performance multi-agent systems. It leverages recursive hierarchical structures to effectively solve complex problems by breaking them down into parallelizable components. This approach enables agents to tackle sophisticated reasoning challenges while maintaining transparency, making context engineering and iteration straightforward.

The core of ROMA operates on a recursive plan-execute loop:

  • Atomizer: Decides if a request is atomic (directly executable) or requires planning.
  • Planner: If planning is needed, the task is broken into smaller subtasks, which are then fed back into the Atomizer, creating a recursive process.
  • Executors: Handle atomic tasks, utilizing LLMs, APIs, or other agents.
  • Aggregator: Collects and integrates results from subtasks to produce the final answer for the parent task.
  • Verifier (Optional): Inspects the aggregated output against the original goal before delivery.

ROMA offers parallel problem-solving, transparent development with a clear structure for easy debugging, and proven performance, as demonstrated through its search agent's strong benchmark results. As an open-source and extensible platform, ROMA is built for community-driven development, allowing you to customize agents for specific needs.

Installation

The recommended way to set up ROMA for production-ready features, including persistence, API, and observability, is using Docker.

Prerequisites:

  • Docker & Docker Compose
  • Python 3.12+ (for local development)
  • Just command runner (optional, recommended)

Complete Setup with Docker:
This single command builds Docker images, starts all services (PostgreSQL, MinIO, REST API), configures the environment, and optionally sets up S3 storage or E2B code execution.

just setup

You can verify services are running by checking curl http://localhost:8000/health.
Required environment variables for LLM providers (e.g., OPENROUTER_API_KEY, OPENAI_API_KEY) are typically configured automatically by just setup or can be set manually.

Examples

ROMA's modular design allows for flexible orchestration. Here's a quick example mirroring a typical end-to-end workflow, showcasing how different modules can work with distinct models and strategies:

import dspy
from roma_dspy import Aggregator, Atomizer, Executor, Planner, Verifier, SubTask
from roma_dspy.types import TaskType

# Optional tool that the Executor may call
def get_weather(city: str) -> str:
    """Return a canned weather report for the city."""
    return f"The weather in {city} is sunny."

# Executor geared toward ReAct with a Fireworks model
executor_lm = dspy.LM(
    "fireworks_ai/accounts/fireworks/models/kimi-k2-instruct-0905",
    temperature=0.7,
    cache=True,
)
executor = Executor(
    lm=executor_lm,
    prediction_strategy="react",
    tools=[get_weather],
    context_defaults={"track_usage": True},
)

# Atomizer decides when to branch into planning
atomizer = Atomizer(
    lm=dspy.LM("openrouter/google/gemini-2.5-flash", temperature=0.6, cache=False),
    prediction_strategy="cot",
    context_defaults={"track_usage": True},
)

# Planner produces executable subtasks for non-atomic goals
planner = Planner(
    lm=dspy.LM("openrouter/openai/gpt-4o-mini", temperature=0.85, cache=True),
    prediction_strategy="cot",
    context_defaults={"track_usage": True},
)

aggregator = Aggregator(
    lm=dspy.LM("openrouter/openai/gpt-4o-mini", temperature=0.65),
    prediction_strategy="cot",
)

verifier = Verifier(
    lm=dspy.LM("openrouter/openai/gpt-4o-mini", temperature=0.0),
)

def run_pipeline(goal: str) -> str:
    atomized = atomizer.forward(goal)
    if atomized.is_atomic or atomized.node_type.is_execute:
        execution = executor.forward(goal)
        candidate = execution.output
    else:
        plan = planner.forward(goal)
        results = []
        for idx, subtask in enumerate(plan.subtasks, start=1):
            execution = executor.forward(subtask.goal)
            results.append(
                SubTask(
                    goal=subtask.goal,
                    task_type=subtask.task_type,
                    dependencies=subtask.dependencies,
                )
            )
        aggregated = aggregator.forward(goal, results)
        candidate = aggregated.synthesized_result

    verdict = verifier.forward(goal, candidate)
    if verdict.verdict:
        return candidate
    return f"Verifier flagged the output: {verdict.feedback or 'no feedback returned'}"

print(run_pipeline("Plan a weekend in Barcelona and include a packing list."))

This example demonstrates how ROMA's modules, each potentially using different Language Models and prediction strategies, can be orchestrated to achieve complex goals.

Why Use ROMA?

ROMA stands out as a robust framework for building advanced AI agents due to several key advantages:

  • Parallel Problem Solving: ROMA's recursive decomposition allows agents to work on different parts of complex tasks simultaneously, significantly improving efficiency and throughput.
  • Transparent Development: The clear, hierarchical structure of ROMA makes it easy to understand, debug, and iterate on agent behaviors, simplifying context engineering.
  • Proven Performance: The framework has demonstrated strong benchmark results, particularly with its search agent, across challenging datasets like SEAL-0, FRAMES, and SimpleQA, showcasing its effectiveness in fact-seeking and reasoning tasks.
  • Open-Source and Extensible: ROMA is designed for community-driven development, providing a flexible and customizable platform where developers can build and adapt agents to their specific needs, benefiting from collective improvements.
  • DSPy Integration: Built upon DSPy, ROMA leverages a declarative framework for prompting, planning, and tool integration, ensuring stable interfaces and powerful reasoning capabilities.
  • Comprehensive Toolkits: It includes 9 built-in toolkits for various functionalities, from file operations and code execution to crypto data and web search, making agents highly versatile.

Links

Related repositories

Similar repositories that may be relevant next.

LangTest: A Comprehensive Library for Safe & Effective Language Models

LangTest: A Comprehensive Library for Safe & Effective Language Models

June 30, 2026

LangTest is an open-source Python library dedicated to ensuring the safety and effectiveness of language models. It offers a comprehensive framework for testing model quality, covering robustness, bias, fairness, and accuracy across various NLP tasks and LLM providers. With LangTest, developers can generate and execute over 60 distinct test types with just one line of code, promoting responsible AI development.

ai-safetyai-testinglarge-language-models
EvalPlus: Rigorous Evaluation for LLM-Synthesized Code

EvalPlus: Rigorous Evaluation for LLM-Synthesized Code

June 30, 2026

EvalPlus is a robust framework designed for the rigorous evaluation of code generated by Large Language Models (LLMs). It extends standard benchmarks like HumanEval and MBPP with significantly more tests, offering precise assessment of code correctness and efficiency. This tool is crucial for developers and researchers aiming to thoroughly validate LLM-synthesized code.

benchmarklarge-language-modelsprogram-synthesis
AgentEvals: Robust Evaluation Tools for LLM Agent Trajectories

AgentEvals: Robust Evaluation Tools for LLM Agent Trajectories

June 30, 2026

AgentEvals is a powerful open-source package from LangChain designed to simplify the evaluation of agentic applications. It provides a collection of ready-made evaluators and utilities, with a particular focus on analyzing agent trajectories, the intermediate steps an agent takes to solve problems. This helps developers understand and improve the reliability and performance of their LLM agents.

PythonLLMAgents
Phoenix: AI Observability and Evaluation Platform for LLMs

Phoenix: AI Observability and Evaluation Platform for LLMs

June 28, 2026

Phoenix is an open-source AI observability platform from Arize AI, designed for comprehensive experimentation, evaluation, and troubleshooting of LLM applications. It provides robust features including OpenTelemetry-based tracing, LLM evaluation, and systematic prompt management. This platform helps developers optimize and debug their AI models effectively across various environments.

AI ObservabilityLLM EvaluationPrompt Engineering

Source repository

Open the original repository on GitHub.

View on GitHub
OS
OSRepos

Analysis and discovery of open source repositories. Find interesting projects and follow their updates.

Monitor your website with YourWebsiteScore

OSRepos shares public repositories for knowledge and discovery only. Any installation, execution, configuration, or use of third-party repository code is at your own risk. Always review source code, dependencies, licenses, and security implications before running anything.

© 2025 OSRepos. Built with Nuxt 3 and lots of ❤️