Repository History

Explore all analyzed open source repositories

Topic: LLMs
AI-Scientist-v2: Automated Scientific Discovery via Agentic Tree Search

AI-Scientist-v2: Automated Scientific Discovery via Agentic Tree Search

AI-Scientist-v2 is an advanced agentic system designed for automated scientific discovery, capable of generating hypotheses, running experiments, analyzing data, and writing scientific manuscripts. This system has successfully produced the first workshop paper written entirely by AI and accepted through peer review, marking a significant step towards fully autonomous research.

May 9, 2026
View Details
LLMSanitize: An Open-Source Library for Contamination Detection in NLP and LLM Datasets

LLMSanitize: An Open-Source Library for Contamination Detection in NLP and LLM Datasets

LLMSanitize is an open-source Python library designed for detecting contamination in NLP datasets and Large Language Models (LLMs). It offers a comprehensive suite of methods, ranging from string matching to model likelihood and embedding similarity, to ensure data integrity. This tool is crucial for researchers and developers working with LLMs to maintain the reliability of their models and evaluations.

Feb 9, 2026
View Details
Transformer Lab App: An Open Source Platform for Frontier AI/ML Workflows

Transformer Lab App: An Open Source Platform for Frontier AI/ML Workflows

Transformer Lab App is an open-source machine learning research platform designed for frontier AI/ML workflows. It provides a comprehensive toolkit for large language models, allowing users to train, tune, and chat on their own machines, whether locally, on-prem, or in the cloud. Backed by Mozilla, this cross-platform application simplifies experimentation with a wide range of models.

Dec 31, 2025
View Details
GenerativeAICourse: A Comprehensive Hands-On Generative AI Engineering Course

GenerativeAICourse: A Comprehensive Hands-On Generative AI Engineering Course

This repository offers a comprehensive, hands-on Generative AI course, starting from fundamental AI concepts to building production-grade applications. It focuses on AI engineering, covering topics like LLMs, RAG, AI agents, and prompt engineering with practical tutorials. The course aims to equip learners with the skills needed to build real-world AI solutions.

Dec 4, 2025
View Details
Lance: Modern Columnar Data Format for ML and LLMs

Lance: Modern Columnar Data Format for ML and LLMs

Lance is a modern columnar data format, implemented in Rust, designed for machine learning and large language model workflows. It offers significant performance improvements over Parquet for random access, includes vector indexing, and supports data versioning. Compatible with popular tools like Pandas, DuckDB, and PyTorch, Lance streamlines data management for ML applications.

Nov 1, 2025
View Details
Instructor: Structured Outputs for LLMs with Pydantic and Python

Instructor: Structured Outputs for LLMs with Pydantic and Python

Instructor is a powerful Python library that simplifies extracting structured data from Large Language Models (LLMs). It integrates Pydantic for robust validation, type safety, and IDE support, eliminating the need for manual JSON parsing, error handling, and retries. This tool provides a streamlined and reliable way to get structured outputs from any LLM.

Oct 12, 2025
View Details
Stagehand: The AI Browser Automation Framework for Production

Stagehand: The AI Browser Automation Framework for Production

Stagehand is an innovative AI browser automation framework that expertly blends the precision of code with the adaptability of natural language. Designed for production environments, it empowers developers to choose between writing low-level Playwright code for specific tasks and leveraging high-level AI agents for dynamic interactions. This framework also enhances the automation process with features like action previewing, intelligent caching, and seamless integration with advanced computer use models.

Oct 11, 2025
View Details
Page 1