Repository History

Explore all analyzed open source repositories

Topic: Machine Learning
index-tts-lora: High-Quality Speech Synthesis with LoRA Fine-tuning

index-tts-lora: High-Quality Speech Synthesis with LoRA Fine-tuning

index-tts-lora offers a robust solution for high-quality speech synthesis, leveraging LoRA fine-tuning on the index-tts framework. It significantly enhances prosody and naturalness for both single and multi-speaker voices. This project provides practical methods for training and inference, making advanced voice synthesis more accessible.

Mar 23, 2026
View Details
Infinity: High-Throughput, Low-Latency Serving for Text Embeddings and Reranking

Infinity: High-Throughput, Low-Latency Serving for Text Embeddings and Reranking

Infinity is a powerful, high-throughput, and low-latency REST API designed for serving various AI models, including text embeddings, reranking, and multi-modal models. It supports deploying any model from HuggingFace with fast inference backends optimized for diverse accelerators. This engine simplifies the deployment and usage of advanced AI models for developers.

Mar 17, 2026
View Details
LLMBox: A Comprehensive Python Library for LLM Training and Evaluation

LLMBox: A Comprehensive Python Library for LLM Training and Evaluation

LLMBox is a comprehensive Python library designed for implementing Large Language Models, offering a unified training pipeline and extensive model evaluation capabilities. It provides a one-stop solution for both training and utilizing LLMs, emphasizing flexibility and efficiency. Developers can leverage its diverse training strategies and blazingly fast inference for their LLM projects.

Mar 16, 2026
View Details
Rio: Build Web and Desktop Apps in Pure Python, No JavaScript Needed

Rio: Build Web and Desktop Apps in Pure Python, No JavaScript Needed

Rio is an innovative Python framework that allows developers to create web and desktop applications using pure Python, eliminating the need for HTML, CSS, or JavaScript. It provides a modern, declarative UI approach with over 50 built-in components, making app development efficient and enjoyable. With Rio, you can build powerful, type-safe applications that run seamlessly across different environments.

Mar 9, 2026
View Details
Magenta RT: Live Music Generation on Your Local Device

Magenta RT: Live Music Generation on Your Local Device

Magenta RealTime (Magenta RT) is an open-source Python library for live music audio generation on local devices. It allows users to create music using both text and audio prompts, serving as a powerful tool for real-time creative audio exploration. This library is the on-device companion to Google's MusicFX DJ Mode and the Lyria RealTime API.

Mar 6, 2026
View Details
NUDGE: Lightweight Non-Parametric Embedding Fine-Tuning for Retrieval

NUDGE: Lightweight Non-Parametric Embedding Fine-Tuning for Retrieval

NUDGE is a lightweight, non-parametric tool designed to fine-tune pre-trained embeddings, significantly enhancing retrieval and RAG pipelines. It operates by adjusting data embeddings directly, rather than modifying model parameters, to maximize accuracy. This approach often leads to over 10% improvement in retrieval accuracy and runs in minutes.

Mar 4, 2026
View Details
maestro: Streamlining Fine-Tuning for Multimodal Models like PaliGemma 2 and Florence-2

maestro: Streamlining Fine-Tuning for Multimodal Models like PaliGemma 2 and Florence-2

maestro is a powerful tool designed to accelerate the fine-tuning process for multimodal models. It encapsulates best practices, handling configuration, data loading, reproducibility, and training loop setup efficiently. The project currently offers ready-to-use recipes for popular vision-language models, including Florence-2, PaliGemma 2, and Qwen2.5-VL.

Mar 2, 2026
View Details
KBLaM: Knowledge Base Augmented Language Models for Enhanced LLMs

KBLaM: Knowledge Base Augmented Language Models for Enhanced LLMs

KBLaM, developed by Microsoft, is the official implementation of "Knowledge Base Augmented Language Models" presented at ICLR 2025. This innovative method enhances Large Language Models by directly integrating external knowledge bases, offering an efficient alternative to traditional Retrieval-Augmented Generation (RAG) and in-context learning. It eliminates external retrieval modules and scales computationally linearly with knowledge base size, rather than quadratically.

Feb 28, 2026
View Details
Verifiers: Environments for LLM Reinforcement Learning and Evaluation

Verifiers: Environments for LLM Reinforcement Learning and Evaluation

Verifiers is a Python library by Prime Intellect AI for building environments to train and evaluate Large Language Models (LLMs). It enables the creation of custom environments with datasets, model harnesses, and reward functions, supporting reinforcement learning, capability evaluation, and synthetic data generation. This library is tightly integrated with the Prime Intellect ecosystem, including their Environments Hub and training framework.

Feb 14, 2026
View Details
TEN VAD: Low-Latency, High-Performance Voice Activity Detector

TEN VAD: Low-Latency, High-Performance Voice Activity Detector

TEN VAD is a low-latency, high-performance, and lightweight Voice Activity Detector (VAD) designed for real-time enterprise use. It provides accurate frame-level speech activity detection, outperforming common alternatives like WebRTC VAD and Silero VAD. This system is crucial for enhancing conversational AI by reducing end-to-end latency and improving speech segment extraction.

Feb 14, 2026
View Details
LLMSanitize: An Open-Source Library for Contamination Detection in NLP and LLM Datasets

LLMSanitize: An Open-Source Library for Contamination Detection in NLP and LLM Datasets

LLMSanitize is an open-source Python library designed for detecting contamination in NLP datasets and Large Language Models (LLMs). It offers a comprehensive suite of methods, ranging from string matching to model likelihood and embedding similarity, to ensure data integrity. This tool is crucial for researchers and developers working with LLMs to maintain the reliability of their models and evaluations.

Feb 9, 2026
View Details
AIMET: Advanced Quantization and Compression for Neural Networks

AIMET: Advanced Quantization and Compression for Neural Networks

AIMET, the AI Model Efficiency Toolkit, is an open-source Python library developed by Qualcomm Innovation Center, Inc. It provides advanced techniques for quantizing and compressing trained deep learning models. This toolkit helps improve runtime performance and reduce memory footprint, making models more efficient for deployment on edge devices while minimizing accuracy loss.

Feb 3, 2026
View Details
Page 1