Repository History
187 repositories tagged with AI
VGGT: Visual Geometry Grounded Transformer for Rapid 3D Scene Reconstruction
VGGT, the recipient of the CVPR 2025 Best Paper Award, is a Visual Geometry Grounded Transformer developed by Facebook AI and the Visual Geometry Group at Oxford. This innovative feed-forward neural network efficiently infers key 3D scene attributes, including camera parameters, depth maps, and 3D point tracks, from single or multiple images within seconds. It offers a powerful solution for rapid 3D reconstruction and scene understanding.
ClickUi: Your Cross-Platform AI Assistant for Local and Cloud Models
ClickUi is a powerful, open-source AI assistant built entirely in Python, designed for cross-platform use. It seamlessly integrates various AI models, speech recognition, and web scraping capabilities, offering both voice and text interaction modes. This tool allows users to leverage local or paid API models, providing a comprehensive and customizable AI experience directly on their computer.

Model Context Protocol TypeScript SDK: Build MCP Servers and Clients
The `modelcontextprotocol/typescript-sdk` is the official TypeScript SDK for interacting with Model Context Protocol (MCP) servers and clients. It provides a standardized way for applications to offer context to Large Language Models (LLMs), separating context provision from LLM interaction. Developers can use it to easily create MCP servers that expose resources, prompts, and tools, as well as build MCP clients to connect to any MCP server.

Aider: AI Pair Programming in Your Terminal
Aider is an open-source project that brings AI pair programming directly to your terminal, enabling developers to collaborate with large language models (LLMs). It helps in building new projects or enhancing existing codebases efficiently. With robust features like codebase mapping, Git integration, and multi-language support, Aider is a versatile tool for modern development workflows.

WordPress MCP: AI Integration Plugin (Migrate to mcp-adapter)
WordPress MCP is a comprehensive plugin designed to integrate WordPress functionality with AI models using the Model Context Protocol (MCP). It enables secure interaction between AI applications and WordPress sites through standardized interfaces and dual transport protocols. This repository is now deprecated, and users are strongly encouraged to migrate to the new mcp-adapter for ongoing development and support, aligning with the Abilities API moving into WordPress Core.
LAM: Large Avatar Model for One-shot Animatable Gaussian Head
LAM is a cutting-edge project that enables the creation of ultra-realistic, animatable 3D avatars from just a single image in seconds. Leveraging advanced Gaussian Head technology, it offers super-fast cross-platform rendering and a low-latency SDK for real-time interactive chatting. This innovative model is set to be presented at SIGGRAPH 2025.

Trae Agent: An LLM-Based Agent for General Software Engineering Tasks
Trae Agent is an LLM-based agent designed for general-purpose software engineering tasks, offering a powerful CLI interface that understands natural language instructions. It enables complex software engineering workflows using various tools and LLM providers, featuring a transparent, modular, and research-friendly architecture. This project is ideal for studying AI agent architectures and developing novel agent capabilities.
Open NotebookLM: Convert PDFs into Personalized Podcast Episodes
Open NotebookLM is an innovative open-source project that transforms any PDF document into an engaging podcast episode. Inspired by NotebookLM, it leverages powerful LLMs and text-to-speech models to generate natural dialogue from your documents. This tool provides a unique way to consume information, making learning and content absorption more accessible and enjoyable.
logocreator: Free & Open-Source AI Logo Generator by Nutlope
logocreator is an open-source logo generator that leverages Flux Pro 1.1 on Together AI to create professional logos quickly. Built with a modern tech stack including Next.js and TypeScript, it offers customizable styles for generating unique designs. This project is ideal for anyone looking to generate high-quality logos with the power of artificial intelligence.

KAG: Knowledge Augmented Generation for LLM Reasoning in Professional Domains
KAG is a powerful logical form-guided reasoning and retrieval framework built upon the OpenSPG engine and Large Language Models. It is specifically designed to create robust logical reasoning and factual Q&A solutions for specialized domain knowledge bases. This framework effectively overcomes the limitations of traditional RAG vector similarity calculation models and the noise often introduced by GraphRAG approaches.

PyPriompt: Python Library for Priority-Based Prompt Design
PyPriompt is a Python library for designing prompts, inspired by web design libraries like React and FastHTML. It intelligently manages context windows by using priorities to decide what information to include in the prompt, ensuring efficient use of token limits. This tool helps developers create dynamic and adaptable prompts for large language models.

GLM-4.5: Agentic, Reasoning, and Coding Foundation Models for Advanced AI
The GLM-4.5 GitHub repository introduces the GLM-4.5 and GLM-4.6 series of foundation models, designed for advanced agentic, reasoning, and coding capabilities. These models offer significant improvements, including longer context windows, enhanced coding performance, and superior reasoning, making them highly competitive in the LLM landscape. Developers can leverage these models for complex intelligent agent applications, backed by strong benchmark results.