Repository History
10 repositories tagged with rag

OpenDataLoader PDF: AI-Ready Data Extraction and Accessibility Automation
OpenDataLoader PDF is an open-source tool designed for extracting AI-ready data from PDFs and automating PDF accessibility. It provides structured Markdown, JSON with bounding boxes, and HTML outputs, ranking #1 in extraction accuracy benchmarks. The library also offers end-to-end auto-tagging to create screen-reader-ready Tagged PDFs, addressing critical accessibility compliance needs.
claude-mem: Persistent Context Across Sessions for AI Agents
claude-mem is an innovative GitHub repository designed to provide persistent context across sessions for various AI agents. It intelligently captures agent activities, compresses them using AI, and injects relevant information into future interactions. This powerful tool supports a wide range of AI platforms, including Claude Code, OpenClaw, Gemini, and Copilot.
Gurubase: AI-Powered Q&A Assistant Issue Tracker
Gurubase is a platform designed to transform your content into a 24/7 AI support assistant. This GitHub repository serves as the official hub for managing issues, feature requests, and bug reports related to the Gurubase product. It provides a centralized place for the community to contribute to the product's improvement.

Memary: The Open Source Memory Layer for Autonomous Agents
Memary is an innovative open-source memory layer designed to enhance autonomous agents by emulating human memory. It integrates knowledge graphs and memory modules to provide agents with advanced capabilities for reasoning and learning. This project aims to make agents more intelligent and capable of self-improvement.

GraphRAG: A Modular Graph-Based RAG System for LLM Discovery
GraphRAG, developed by Microsoft, is a powerful and modular graph-based Retrieval-Augmented Generation (RAG) system. It is designed to extract meaningful, structured data from unstructured text using Large Language Models (LLMs). This system enhances an LLM's ability to reason about private and narrative data by leveraging knowledge graph memory structures.
kotaemon: An Open-Source RAG Tool for Document Chat
kotaemon is an open-source, RAG-based tool designed to facilitate interactive conversations with your documents. It provides a clean and customizable UI, catering to both end-users seeking document Q&A and developers building RAG pipelines.

RAG Web UI: An Intelligent Dialogue System with Retrieval-Augmented Generation
RAG Web UI is an intelligent dialogue system leveraging Retrieval-Augmented Generation (RAG) technology to build robust Q&A systems. It enables users to create knowledge bases from various document formats and supports multiple LLM deployment options, including cloud services and local models like Ollama. The system also offers OpenAPI interfaces for seamless integration.

MaxKB: Open-Source Platform for Enterprise-Grade AI Agents
MaxKB is a powerful, open-source platform designed for building enterprise-grade AI agents. It features integrated Retrieval-Augmented Generation (RAG) pipelines, robust workflow orchestration, and advanced MCP tool-use capabilities. This platform is ideal for intelligent customer service, corporate knowledge bases, and academic applications.

Memvid: Revolutionizing AI Memory with Video Compression and Semantic Search
Memvid is an innovative Python library that transforms vast amounts of text data into compact, searchable MP4 video files. It leverages advanced video codecs to store millions of text chunks as QR codes, enabling lightning-fast semantic search without the need for traditional databases. This approach offers significant storage savings, true portability, and an offline-first design for AI memory applications.

SurfSense: Open Source AI Research Agent with Extensive Integrations
SurfSense is an open-source AI research agent designed as an alternative to tools like NotebookLM and Perplexity. It integrates with a wide array of external sources, including search engines, Slack, Notion, and GitHub, allowing users to connect their personal knowledge base. This highly customizable tool offers features like powerful search, chat with saved content, cited answers, and local LLM support for enhanced privacy.