Airweave: Context Retrieval for AI Agents Across Apps and Databases

This repository profile is provided by osrepos.com, an open source repository discovery platform.

Airweave: Context Retrieval for AI Agents Across Apps and Databases

Summary

Airweave is an open-source context retrieval layer designed for AI agents, enabling them to access information across various applications and databases. It transforms diverse content into searchable knowledge bases, offering a standardized interface for agents to perform semantic, hybrid, and recency-biased searches. The platform simplifies data synchronization, entity extraction, and serves as a robust foundation for building intelligent AI applications.

Repository Information

Analyzed by OSRepos on December 19, 2025

Use at your own risk

OSRepos shares public repositories for knowledge and discovery only. Any installation, execution, configuration, or use of code from these repositories is the user's own responsibility. Always review the repository, source code, dependencies, licenses, and security implications before running or installing anything. OSRepos is not responsible for issues, damages, or losses resulting from third-party repositories.

Introduction

Airweave is a fully open-source context retrieval layer designed to empower AI agents by providing them with access to information across various applications and databases. It connects seamlessly to productivity tools, databases, and document stores, transforming their diverse contents into semantically searchable knowledge bases. This platform offers a standardized interface, accessible via REST API or MCP, handling everything from authentication and data extraction to embedding and serving, making it easier for agents to find relevant information.

Installation

You can get started with Airweave quickly by self-hosting it using Docker and Docker Compose.

First, ensure Docker and Docker Compose are installed on your system.

# 1. Clone the repository
git clone https://github.com/airweave-ai/airweave.git
cd airweave

# 2. Build and run
chmod +x start.sh
./start.sh

Once the setup is complete, you can access the Airweave dashboard at http://localhost:8080. For a managed service option, consider Airweave Cloud.

Examples

Airweave provides SDKs for both Python and TypeScript/JavaScript, allowing developers to easily integrate its powerful search capabilities into their applications.

Python SDK

Install the SDK:

pip install airweave-sdk

Example usage:

from airweave import AirweaveSDK

# Initialize client
client = AirweaveSDK(
    api_key="YOUR_API_KEY",
    base_url="http://localhost:8001"
)

# Create a collection
collection = client.collections.create(name="My Collection")

# Add a source connection (e.g., Stripe)
source = client.source_connections.create(
    name="My Stripe Connection",
    short_name="stripe",
    readable_collection_id=collection.readable_id,
    authentication={
        "credentials": {"api_key": "your_stripe_api_key"}
    }
)

# Semantic search (default)
results = client.collections.search(
    readable_id=collection.readable_id,
    query="Find recent failed payments"
)

# Hybrid search (semantic + keyword)
results = client.collections.search(
    readable_id=collection.readable_id,
    query="customer invoices Q4 2024",
    search_type="hybrid"
)

# Get AI-generated answer instead of raw results
answer = client.collections.search(
    readable_id=collection.readable_id,
    query="What are our customer refund policies?",
    response_type="completion",
    enable_reranking=True
)

TypeScript/JavaScript SDK

Install the SDK:

npm install @airweave/sdk
# or
yarn add @airweave/sdk

Example usage:

import { AirweaveSDKClient, AirweaveSDKEnvironment } from "@airweave/sdk";

// Initialize client
const client = new AirweaveSDKClient({
    apiKey: "YOUR_API_KEY",
    environment: AirweaveSDKEnvironment.Local
});

// Create a collection
const collection = await client.collections.create({
    name: "My Collection"
});

// Add a source connection (e.g., Stripe)
const source = await client.sourceConnections.create({
    name: "My Stripe Connection",
    shortName: "stripe",
    readableCollectionId: collection.readableId,
    authentication: {
        credentials: { apiKey: "your_stripe_api_key" }
    }
});

// Semantic search (default)
const results = await client.collections.search(
    collection.readableId,
    { query: "Find recent failed payments" }
);

// Hybrid search (semantic + keyword)
const hybridResults = await client.collections.search(
    collection.readableId,
    {
        query: "customer invoices Q4 2024",
        searchType: "hybrid"
    }
);

// Get AI-generated answer instead of raw results
const answer = await client.collections.search(
    collection.readableId,
    {
        query: "What are our customer refund policies?",
        responseType: "completion",
        enableReranking: true
    }
);

Why Use Airweave

Airweave stands out as a powerful solution for enhancing AI agent capabilities through robust context retrieval. Its key features include:

  • Extensive Data Synchronization: Connects to over 30 sources, including popular apps like GitHub, Notion, Slack, Salesforce, and various databases, with minimal configuration.
  • Intelligent Data Processing: Features entity extraction and a sophisticated transformation pipeline to prepare data for optimal search.
  • Scalable Architecture: Built with a multi-tenant architecture and OAuth2 support, ensuring security and scalability.
  • Efficient Updates: Utilizes content hashing for incremental updates, keeping knowledge bases fresh without redundant processing.
  • Advanced Search Capabilities: Offers semantic search for nuanced agent queries, alongside hybrid search (semantic + keyword), query expansion, reranking, and recency bias options.
  • Data Versioning: Provides versioning for data changes, ensuring traceability and reliability.

With its comprehensive feature set and support for a wide array of integrations, Airweave simplifies the creation of intelligent AI applications that can effectively leverage vast amounts of distributed information.

Links

Related repositories

Similar repositories that may be relevant next.

Phoenix: AI Observability and Evaluation Platform for LLMs

Phoenix: AI Observability and Evaluation Platform for LLMs

June 28, 2026

Phoenix is an open-source AI observability platform from Arize AI, designed for comprehensive experimentation, evaluation, and troubleshooting of LLM applications. It provides robust features including OpenTelemetry-based tracing, LLM evaluation, and systematic prompt management. This platform helps developers optimize and debug their AI models effectively across various environments.

AI ObservabilityLLM EvaluationPrompt Engineering
Odysseus: A Comprehensive Self-Hosted AI Workspace for Productivity

Odysseus: A Comprehensive Self-Hosted AI Workspace for Productivity

June 25, 2026

Odysseus is a powerful self-hosted AI workspace designed to integrate various AI-powered tools into a single platform. It offers functionalities for chat, agents, deep research, document management, email, and calendar, supporting both local and API models. This comprehensive solution aims to enhance productivity and streamline AI workflows in a private environment.

AI WorkspaceSelf-HostedPython
Loop Library: Practical Repeatable AI-Agent Workflows

Loop Library: Practical Repeatable AI-Agent Workflows

June 21, 2026

The Loop Library is a GitHub repository offering reusable AI agent workflows for various domains like engineering, content, and design. It introduces the concept of "loops," which are structured, repeatable instructions that guide AI agents through multi-step tasks. This skill enables agents to learn from results, adapt, and complete complex tasks more reliably than with one-shot prompts.

AI AgentsAgentic WorkflowsPrompt Engineering
agency-agents: Your Complete AI Agency of Specialized Experts

agency-agents: Your Complete AI Agency of Specialized Experts

May 30, 2026

agency-agents offers a comprehensive collection of over 140 meticulously crafted AI agent personalities, designed to act as specialized experts across various domains. From frontend development to marketing and sales, each agent comes with a unique voice, proven processes, and deliverable-focused outcomes. This repository provides a ready-to-deploy AI dream team to transform your workflow and accelerate project delivery.

AI AgentsShellDeveloper Tools

Source repository

Open the original repository on GitHub.

View on GitHub
OS
OSRepos

Analysis and discovery of open source repositories. Find interesting projects and follow their updates.

Monitor your website with YourWebsiteScore

OSRepos shares public repositories for knowledge and discovery only. Any installation, execution, configuration, or use of third-party repository code is at your own risk. Always review source code, dependencies, licenses, and security implications before running anything.

© 2025 OSRepos. Built with Nuxt 3 and lots of ❤️