Repository History
233 repositories tagged with Python

AuditNLG: Auditing Generative AI for Trustworthiness
AuditNLG is an open-source library from Salesforce designed to enhance the trustworthiness of generative AI language models. It provides state-of-the-art techniques to detect and improve factualness, safety, and constraint adherence in AI-generated text. This library simplifies the process of auditing AI outputs, offering explanations and alternative suggestions for problematic content.

Odysseus: A Comprehensive Self-Hosted AI Workspace for Productivity
Odysseus is a powerful self-hosted AI workspace designed to integrate various AI-powered tools into a single platform. It offers functionalities for chat, agents, deep research, document management, email, and calendar, supporting both local and API models. This comprehensive solution aims to enhance productivity and streamline AI workflows in a private environment.

Headroom: Drastically Reduce LLM Token Usage for AI Agents
Headroom is an innovative context compression layer for AI agents, designed to significantly reduce token usage for LLMs. It achieves 60-95% fewer tokens across various inputs like tool outputs, logs, files, and RAG chunks, all while preserving answer accuracy. This powerful tool enhances efficiency and cost-effectiveness for AI interactions.

spacy-llm: Integrating LLMs into Structured NLP Pipelines with spaCy
spacy-llm seamlessly integrates Large Language Models (LLMs) into spaCy, offering a modular system for rapid prototyping and transforming unstructured LLM responses into robust outputs for various NLP tasks. It supports a wide range of LLMs, including OpenAI, Cohere, Anthropic, and open-source models, enabling users to combine the power of LLMs with spaCy's production-ready capabilities. This package allows for quick experimentation and the creation of efficient, reliable, and controlled NLP systems.
PixelRAG: Pixel-Native Search for Visual Retrieval-Augmented Generation
PixelRAG revolutionizes search by enabling pixel-native retrieval, moving beyond traditional text parsing. It renders documents as screenshots, preserving visual context like tables and charts, which is crucial for accurate answers from reader models. This allows for searching any document based on its visual appearance, not just its textual content.

Tiny8: An Educational 8-bit CPU Simulator with Interactive Visualization
Tiny8 is a lightweight and educational 8-bit CPU simulator written in Python. It offers an interactive terminal debugger and graphical animation to visualize program execution, making it ideal for learning computer architecture and assembly programming.

AutoHedge: Build Your Autonomous AI Hedge Fund with Swarm Intelligence
AutoHedge is an enterprise-grade autonomous agent hedge fund that leverages swarm intelligence and specialized AI agents. This powerful Python project automates end-to-end market analysis, risk management, and trade execution. It allows users to build and deploy their own AI-driven trading strategies with minimal human intervention.

IPSpot: A Python Library for IP and Geolocation Data Retrieval
IPSpot is a powerful Python library designed to retrieve both public and private IPv4 and IPv6 addresses. It also provides detailed geolocation data, such as region, longitude, and latitude, using a reliable fallback mechanism across multiple API providers. This makes it an essential tool for network monitoring and location-based lookups directly from your system.

MOSS-TTS Family: Open-Source High-Fidelity Speech and Sound Generation
The MOSS-TTS Family offers an open-source suite of models for high-fidelity, highly expressive speech and sound generation. Designed for complex real-world scenarios, it covers stable long-form speech, multi-speaker dialogue, voice design, environmental sound effects, and real-time streaming TTS. This comprehensive family of models from MOSI.AI and OpenMOSS team provides robust solutions for diverse audio generation needs.

autoresearch: AI Agents for Autonomous LLM Training Research
autoresearch, by Andrej Karpathy, pioneers autonomous AI research by enabling agents to experiment with LLM training on a single GPU. The system allows an AI agent to modify code, train a model for a fixed 5-minute duration, and iteratively optimize for improved performance. This innovative approach aims to automate the experimental cycle of AI research, fostering continuous discovery and optimization.
MiroFish: A Universal Swarm Intelligence Engine for Predicting Anything
MiroFish is a cutting-edge AI prediction engine that leverages multi-agent technology to simulate future outcomes. It constructs high-fidelity digital worlds where intelligent agents interact, allowing users to test scenarios and deduce future trajectories. This innovative platform enables predictions across various domains, from public opinion to financial markets and even creative narrative endings.

GLM-OCR: Accurate, Fast, and Comprehensive Multimodal OCR Model
GLM-OCR is a powerful multimodal OCR model designed for complex document understanding, built on the GLM-V encoder-decoder architecture. It achieves state-of-the-art performance across various benchmarks, offering efficient inference and easy integration. This open-source solution is optimized for real-world business scenarios, providing robust and high-quality OCR capabilities.