Repository History

Explore all analyzed open source repositories

Topic: Generative AI
Docling: Streamlining Document Processing for Generative AI

Docling: Streamlining Document Processing for Generative AI

Docling is a powerful Python library designed to simplify document processing and prepare diverse formats for generative AI applications. It excels at parsing various document types, including advanced PDF understanding, and offers seamless integrations with popular AI frameworks. With Docling, developers can efficiently extract, transform, and utilize document content for their AI models.

Mar 22, 2026
View Details
LogoAI: AI-Powered Logo Generator with Next.js and Nebius AI

LogoAI: AI-Powered Logo Generator with Next.js and Nebius AI

LogoAI is an innovative web application that leverages artificial intelligence to create unique and professional logos. Built with Next.js and TypeScript, and powered by Nebius AI, it offers a streamlined experience for generating custom logos for various brands. Users can explore multiple AI models, customize designs, and manage their generation history.

Feb 11, 2026
View Details
fast-agent: Build and Orchestrate Multimodal AI Agents and Workflows

fast-agent: Build and Orchestrate Multimodal AI Agents and Workflows

fast-agent is a powerful Python framework designed for creating and interacting with sophisticated multimodal AI agents and workflows. It offers a simple, declarative syntax for defining agents, comprehensive model support, and unique features like end-to-end tested MCP (Multi-modal Communication Protocol) integration. Developers can rapidly build, test, and deploy complex agent applications with advanced capabilities such as structured outputs, vision, and various orchestration patterns.

Jan 9, 2026
View Details
StreamDiffusion: Real-Time Interactive Generation with Diffusion Pipelines

StreamDiffusion: Real-Time Interactive Generation with Diffusion Pipelines

StreamDiffusion is an innovative diffusion pipeline designed for real-time interactive generation, significantly enhancing the performance of current diffusion-based image generation techniques. It offers a pipeline-level solution to achieve high-speed image and text-to-image generation, making interactive AI experiences more accessible. This project introduces several key features to optimize computational efficiency and GPU utilization.

Dec 13, 2025
View Details
GenerativeAICourse: A Comprehensive Hands-On Generative AI Engineering Course

GenerativeAICourse: A Comprehensive Hands-On Generative AI Engineering Course

This repository offers a comprehensive, hands-on Generative AI course, starting from fundamental AI concepts to building production-grade applications. It focuses on AI engineering, covering topics like LLMs, RAG, AI agents, and prompt engineering with practical tutorials. The course aims to equip learners with the skills needed to build real-world AI solutions.

Dec 4, 2025
View Details
audio2photoreal: Synthesizing Photorealistic Codec Avatars from Audio

audio2photoreal: Synthesizing Photorealistic Codec Avatars from Audio

audio2photoreal is a powerful GitHub repository from Facebook Research that provides code and a dataset for generating photorealistic Codec Avatars driven solely from audio input. This project enables the synthesis of human embodiment in conversations, offering tools for training, testing, and running pretrained models to create lifelike digital representations. It represents a significant advancement in AI-driven computer graphics and virtual reality.

Nov 20, 2025
View Details
Podcastfy: Transform Multimodal Content into AI-Generated Multilingual Podcasts

Podcastfy: Transform Multimodal Content into AI-Generated Multilingual Podcasts

Podcastfy is an open-source Python package that transforms diverse multimodal content, such as text, images, and videos, into engaging multilingual audio conversations. Utilizing generative AI, it offers a flexible and programmatic alternative to tools like NotebookLM, focusing on customization and scalability. This makes it an excellent solution for content creators, educators, and researchers aiming to broaden their audience reach and improve content accessibility.

Nov 9, 2025
View Details
Weave by Weights & Biases: A Toolkit for AI-Powered Applications

Weave by Weights & Biases: A Toolkit for AI-Powered Applications

Weave is an open-source toolkit developed by Weights & Biases designed for building and managing AI-powered applications. It provides robust features for logging, debugging, and evaluating language model inputs and outputs, streamlining the development workflow for generative AI. Weave aims to bring rigor and best practices to the experimental process of AI software development.

Nov 3, 2025
View Details
Step-Video-T2V: State-of-the-Art Text-to-Video Generation Model

Step-Video-T2V: State-of-the-Art Text-to-Video Generation Model

Step-Video-T2V is a state-of-the-art text-to-video pre-trained model capable of generating videos up to 204 frames with 30 billion parameters. It achieves high efficiency through a deep compression Video-VAE and enhances visual quality using Direct Preference Optimization (DPO). The model's performance is validated on its novel benchmark, Step-Video-T2V-Eval, demonstrating superior text-to-video quality.

Oct 29, 2025
View Details
Fluxgym: Simple FLUX LoRA Training UI with Low VRAM Support

Fluxgym: Simple FLUX LoRA Training UI with Low VRAM Support

Fluxgym offers a user-friendly web interface for training FLUX LoRA models, specifically designed to support systems with low VRAM, such as 12GB, 16GB, and 20GB GPUs. It combines the simplicity of a Gradio UI, forked from AI-Toolkit, with the powerful and flexible training capabilities of Kohya sd-scripts. This tool allows users to easily train custom LoRAs, including advanced features like automatic sample image generation and direct publishing to Hugging Face.

Oct 12, 2025
View Details
Rig: Build Modular and Scalable LLM Applications in Rust

Rig: Build Modular and Scalable LLM Applications in Rust

Rig is a powerful Rust library designed for building modular, scalable, and ergonomic LLM-powered applications. It offers extensive features, including agentic workflows, compatibility with over 20 model providers, and seamless integration with more than 10 vector stores. Developers can leverage Rig to create robust generative AI solutions with minimal boilerplate.

Oct 12, 2025
View Details
presentation-ai: AI-Powered Presentation Generator for Professional Slides

presentation-ai: AI-Powered Presentation Generator for Professional Slides

presentation-ai is an open-source, AI-powered presentation generator that serves as an alternative to tools like Gamma.app. It enables users to quickly create professional and customizable slides with AI-generated content. This tool is designed to streamline the presentation creation process, offering various themes and real-time generation capabilities.

Oct 11, 2025
View Details
Page 1