# Airweave: Context Retrieval for AI Agents Across Apps and Databases

This repository profile is provided by osrepos.com, an open source repository discovery platform.

Source: osrepos.com
Repository profile: https://osrepos.com/repo/airweave-ai-airweave
Generated for open source discovery and AI-assisted research.

Airweave is an open-source context retrieval layer designed for AI agents, enabling them to access information across various applications and databases. It transforms diverse content into searchable knowledge bases, offering a standardized interface for agents to perform semantic, hybrid, and recency-biased searches. The platform simplifies data synchronization, entity extraction, and serves as a robust foundation for building intelligent AI applications.

GitHub: https://github.com/airweave-ai/airweave
OSRepos URL: https://osrepos.com/repo/airweave-ai-airweave

## Summary

Airweave is an open-source context retrieval layer designed for AI agents, enabling them to access information across various applications and databases. It transforms diverse content into searchable knowledge bases, offering a standardized interface for agents to perform semantic, hybrid, and recency-biased searches. The platform simplifies data synchronization, entity extraction, and serves as a robust foundation for building intelligent AI applications.

## Topics

- AI Agents
- Context Retrieval
- LLM
- RAG
- Vector Database
- Knowledge Graph
- Python
- Data Synchronization

## Repository Information

Last analyzed by OSRepos: Fri Dec 19 2025 12:01:18 GMT+0000 (Western European Standard Time)
Detail views: 4
GitHub clicks: 3

## Safety Notice

OSRepos shares public repositories for knowledge and discovery only. Review source code, dependencies, licenses, and security implications before running or installing anything.

## Content

## Introduction

Airweave is a fully open-source context retrieval layer designed to empower AI agents by providing them with access to information across various applications and databases. It connects seamlessly to productivity tools, databases, and document stores, transforming their diverse contents into semantically searchable knowledge bases. This platform offers a standardized interface, accessible via REST API or MCP, handling everything from authentication and data extraction to embedding and serving, making it easier for agents to find relevant information.

## Installation

You can get started with Airweave quickly by self-hosting it using Docker and Docker Compose.

First, ensure Docker and Docker Compose are installed on your system.

bash
# 1. Clone the repository
git clone https://github.com/airweave-ai/airweave.git
cd airweave

# 2. Build and run
chmod +x start.sh
./start.sh


Once the setup is complete, you can access the Airweave dashboard at `http://localhost:8080`. For a managed service option, consider [Airweave Cloud](https://app.airweave.ai/).

## Examples

Airweave provides SDKs for both Python and TypeScript/JavaScript, allowing developers to easily integrate its powerful search capabilities into their applications.

### Python SDK

Install the SDK:

bash
pip install airweave-sdk


Example usage:

python
from airweave import AirweaveSDK

# Initialize client
client = AirweaveSDK(
    api_key="YOUR_API_KEY",
    base_url="http://localhost:8001"
)

# Create a collection
collection = client.collections.create(name="My Collection")

# Add a source connection (e.g., Stripe)
source = client.source_connections.create(
    name="My Stripe Connection",
    short_name="stripe",
    readable_collection_id=collection.readable_id,
    authentication={
        "credentials": {"api_key": "your_stripe_api_key"}
    }
)

# Semantic search (default)
results = client.collections.search(
    readable_id=collection.readable_id,
    query="Find recent failed payments"
)

# Hybrid search (semantic + keyword)
results = client.collections.search(
    readable_id=collection.readable_id,
    query="customer invoices Q4 2024",
    search_type="hybrid"
)

# Get AI-generated answer instead of raw results
answer = client.collections.search(
    readable_id=collection.readable_id,
    query="What are our customer refund policies?",
    response_type="completion",
    enable_reranking=True
)


### TypeScript/JavaScript SDK

Install the SDK:

bash
npm install @airweave/sdk
# or
yarn add @airweave/sdk


Example usage:

typescript
import { AirweaveSDKClient, AirweaveSDKEnvironment } from "@airweave/sdk";

// Initialize client
const client = new AirweaveSDKClient({
    apiKey: "YOUR_API_KEY",
    environment: AirweaveSDKEnvironment.Local
});

// Create a collection
const collection = await client.collections.create({
    name: "My Collection"
});

// Add a source connection (e.g., Stripe)
const source = await client.sourceConnections.create({
    name: "My Stripe Connection",
    shortName: "stripe",
    readableCollectionId: collection.readableId,
    authentication: {
        credentials: { apiKey: "your_stripe_api_key" }
    }
});

// Semantic search (default)
const results = await client.collections.search(
    collection.readableId,
    { query: "Find recent failed payments" }
);

// Hybrid search (semantic + keyword)
const hybridResults = await client.collections.search(
    collection.readableId,
    {
        query: "customer invoices Q4 2024",
        searchType: "hybrid"
    }
);

// Get AI-generated answer instead of raw results
const answer = await client.collections.search(
    collection.readableId,
    {
        query: "What are our customer refund policies?",
        responseType: "completion",
        enableReranking: true
    }
);


## Why Use Airweave

Airweave stands out as a powerful solution for enhancing AI agent capabilities through robust context retrieval. Its key features include:

*   **Extensive Data Synchronization**: Connects to over 30 sources, including popular apps like GitHub, Notion, Slack, Salesforce, and various databases, with minimal configuration.
*   **Intelligent Data Processing**: Features entity extraction and a sophisticated transformation pipeline to prepare data for optimal search.
*   **Scalable Architecture**: Built with a multi-tenant architecture and OAuth2 support, ensuring security and scalability.
*   **Efficient Updates**: Utilizes content hashing for incremental updates, keeping knowledge bases fresh without redundant processing.
*   **Advanced Search Capabilities**: Offers semantic search for nuanced agent queries, alongside hybrid search (semantic + keyword), query expansion, reranking, and recency bias options.
*   **Data Versioning**: Provides versioning for data changes, ensuring traceability and reliability.

With its comprehensive feature set and support for a wide array of integrations, Airweave simplifies the creation of intelligent AI applications that can effectively leverage vast amounts of distributed information.

## Links

*   **GitHub Repository**: [airweave-ai/airweave](https://github.com/airweave-ai/airweave)
*   **Official Documentation**: [Airweave Docs](https://docs.airweave.ai/welcome)
*   **Discord Community**: [Join Discord](https://discord.com/invite/484HY9Ehxt)
*   **Twitter**: [Follow @airweave_ai](https://x.com/airweave_ai)