Repository History
187 repositories tagged with AI
CORE: A Unified Memory System for Your AI Applications
CORE by RedPlanetHQ is an open-source project designed to provide a persistent, unified memory layer for AI applications. It leverages a temporal knowledge graph to prevent context loss across various AI tools, ensuring LLMs retain past conversations, preferences, and project history. This system significantly enhances AI interactions by making context available across different sessions and platforms.

ROMA: Recursive Open Meta-Agents for High-Performance Multi-Agent Systems
ROMA is a powerful meta-agent framework designed for building high-performance multi-agent systems using recursive hierarchical structures. It simplifies complex problem-solving by breaking tasks into parallelizable components, offering transparent development and proven performance. This open-source framework is extensible, allowing developers to customize agents and benefit from community-driven improvements.
txtinstruct: Building Instruction-Tuned Models with Custom Data
txtinstruct is a Python framework designed for training instruction-tuned models. It focuses on supporting open data and models, enabling users to build their own instruction-following datasets and train models without licensing ambiguity. This project simplifies the process of creating custom instruction-tuned solutions.
audio2photoreal: Synthesizing Photorealistic Codec Avatars from Audio
audio2photoreal is a powerful GitHub repository from Facebook Research that provides code and a dataset for generating photorealistic Codec Avatars driven solely from audio input. This project enables the synthesis of human embodiment in conversations, offering tools for training, testing, and running pretrained models to create lifelike digital representations. It represents a significant advancement in AI-driven computer graphics and virtual reality.

mlx-examples: Practical Examples for the MLX Machine Learning Framework
mlx-examples is a comprehensive GitHub repository showcasing a variety of standalone examples built using the MLX framework. It provides practical implementations across text, image, audio, and multimodal models, serving as an excellent resource for developers exploring MLX. This collection helps users understand and apply MLX for diverse machine learning tasks.

Open R1: An Open-Source Reproduction of DeepSeek-R1 for Advanced LLM Training
Open R1 is a Hugging Face project dedicated to creating a fully open reproduction of DeepSeek-R1, a powerful reasoning language model. This initiative provides comprehensive tools and recipes for training, evaluating, and generating data for large language models. It fosters community collaboration in AI research, enabling developers to build upon and understand the complex R1 pipeline.

Qwen Code: An AI-Powered Command-Line Workflow Tool for Developers
Qwen Code is an advanced command-line AI workflow tool designed to enhance developer productivity. Adapted from Gemini CLI and optimized for Qwen3-Coder models, it offers intelligent assistance for coding, understanding large codebases, and automating development tasks. This tool provides powerful features to streamline your daily coding activities, including free usage options.

TOON: Compact, Human-Readable JSON for LLM Prompts
TOON, or Token-Oriented Object Notation, is a compact and human-readable data format designed to optimize JSON serialization for Large Language Model (LLM) prompts. It significantly reduces token count while maintaining explicit structure, making data more efficient and reliable for AI applications. This format combines indentation-based structure with tabular layouts for uniform arrays, offering a powerful alternative to traditional JSON and YAML.

InfiniteTalk: Unlimited-Length AI Video Generation from Audio or Images
InfiniteTalk is an innovative AI model for generating unlimited-length talking videos. It excels at creating realistic video content from audio, supporting both image-to-video and video-to-video generation. This framework ensures accurate lip synchronization and consistent identity preservation, aligning head movements, body posture, and facial expressions with the input audio.

llama-cpp-python: Python Bindings for llama.cpp
llama-cpp-python provides robust Python bindings for the popular llama.cpp library, enabling efficient local inference with large language models. It offers a high-level API compatible with OpenAI's API, facilitating easy integration into existing applications. The project also includes a powerful web server for local deployment and supports various hardware acceleration backends.

Podcastfy: Transform Multimodal Content into AI-Generated Multilingual Podcasts
Podcastfy is an open-source Python package that transforms diverse multimodal content, such as text, images, and videos, into engaging multilingual audio conversations. Utilizing generative AI, it offers a flexible and programmatic alternative to tools like NotebookLM, focusing on customization and scalability. This makes it an excellent solution for content creators, educators, and researchers aiming to broaden their audience reach and improve content accessibility.

Marker: High-Accuracy Document Conversion to Markdown and JSON
Marker is an open-source Python tool designed for high-accuracy conversion of documents like PDFs, images, and office files into Markdown, JSON, and HTML. It excels at preserving complex formatting, extracting images, and can leverage LLMs for even greater precision. This makes Marker a powerful solution for structured document intelligence.