OSRepos
Discover open source projects through curated analysis, useful topics, and repository deep dives.

Featured repository
ChatArena: Multi-Agent Language Game Environments for LLMs
ChatArena is a Python library designed to provide multi-agent language game environments for Large Language Models (LLMs), aiming to foster the development of communication and collaboration capabilities in AI. It offers a flexible framework for defining players, environments, and interactions based on Markov Decision Processes. Please note that as of August 11, 2025, this project has been deprecated due to a lack of widespread community use and is no longer receiving updates or support.
Explore by topic
Jump into the most common areas across analyzed repositories.
Recently analyzed
Fresh repository analysis from the OSRepos archive.

ChatArena: Multi-Agent Language Game Environments for LLMs
ChatArena is a Python library designed to provide multi-agent language game environments for Large Language Models (LLMs), aiming to foster the development of communication and collaboration capabilities in AI. It offers a flexible framework for defining players, environments, and interactions based on Markov Decision Processes. Please note that as of August 11, 2025, this project has been deprecated due to a lack of widespread community use and is no longer receiving updates or support.
Agentarium: A Python Framework for AI Agent Simulations
Agentarium is an open-source Python framework designed for creating and managing simulations with AI-powered agents. It offers an intuitive platform for designing complex, interactive environments where agents can act, learn, and evolve. This powerful tool simplifies the orchestration of multiple AI agents and their interactions.
Lighteval: Your All-in-One Toolkit for LLM Evaluation
Lighteval is a comprehensive toolkit from Hugging Face for evaluating Large Language Models (LLMs) across various backends. It enables users to dive deep into model performance by saving detailed, sample-by-sample results and supports over 1000 evaluation tasks. The framework offers extensive customization options, allowing users to create custom tasks and metrics tailored to their specific needs.

PromptBench: A Unified Framework for LLM Evaluation and Robustness
PromptBench is a comprehensive Python library designed for the evaluation and understanding of Large Language Models (LLMs). It provides a unified framework for assessing model performance, exploring various prompt engineering techniques, and evaluating robustness against adversarial attacks. This tool empowers researchers to conduct in-depth analyses of LLMs across diverse datasets and models.

LangTest: A Comprehensive Library for Safe & Effective Language Models
LangTest is an open-source Python library dedicated to ensuring the safety and effectiveness of language models. It offers a comprehensive framework for testing model quality, covering robustness, bias, fairness, and accuracy across various NLP tasks and LLM providers. With LangTest, developers can generate and execute over 60 distinct test types with just one line of code, promoting responsible AI development.

EvalPlus: Rigorous Evaluation for LLM-Synthesized Code
EvalPlus is a robust framework designed for the rigorous evaluation of code generated by Large Language Models (LLMs). It extends standard benchmarks like HumanEval and MBPP with significantly more tests, offering precise assessment of code correctness and efficiency. This tool is crucial for developers and researchers aiming to thoroughly validate LLM-synthesized code.
Discover something different
A rotating sample from deeper in the archive.

Style Observer: Run JavaScript When CSS Properties Change
Style Observer is a robust, production-ready JavaScript library designed to detect and respond to changes in any CSS property. It allows developers to run custom JavaScript code when a CSS property changes, offering a powerful way to create dynamic and reactive web interfaces. The library handles browser inconsistencies and provides a lightweight, dependency-free solution for observing styles.

Zero: An Open-Source AI Email Solution Focused on Privacy and Self-Hosting
Zero is an innovative open-source email application designed to modernize email with AI agents, prioritizing user privacy and self-hosting capabilities. It allows users to integrate external services like Gmail while maintaining control over their data, offering a transparent, secure, and customizable email experience.

Inscribed: Create Stop Motion Animations and Slides with Excalidraw
Inscribed is an innovative, slide-based tool for creating stop motion animations and engaging presentations. It leverages Excalidraw to provide an interactive drawing canvas, supporting features like image import, Google Fonts, and GIF export. This versatile tool is perfect for visual storytelling and dynamic slide creation.
Stay Updated
Get notified about new repositories and updates. Join our community of developers!