{"name":"AI-Scientist-v2: Automated Scientific Discovery via Agentic Tree Search","description":"AI-Scientist-v2 is an advanced agentic system designed for automated scientific discovery, capable of generating hypotheses, running experiments, analyzing data, and writing scientific manuscripts. This system has successfully produced the first workshop paper written entirely by AI and accepted through peer review, marking a significant step towards fully autonomous research.","github":"https://github.com/SakanaAI/AI-Scientist-v2","url":"https://osrepos.com/repo/sakanaai-ai-scientist-v2","source":"osrepos.com","sourceDescription":"This repository profile is provided by osrepos.com, an open source repository discovery platform.","repositoryProfile":"https://osrepos.com/repo/sakanaai-ai-scientist-v2","generatedFor":"open source discovery and AI-assisted research","markdown":"https://osrepos.com/repo/sakanaai-ai-scientist-v2.md","json":"https://osrepos.com/repo/sakanaai-ai-scientist-v2.json","topics":["Python","AI","Machine Learning","Scientific Discovery","Automated Research","Agent Systems","LLMs","Deep Learning"],"keywords":["Python","AI","Machine Learning","Scientific Discovery","Automated Research","Agent Systems","LLMs","Deep Learning"],"stars":null,"summary":"AI-Scientist-v2 is an advanced agentic system designed for automated scientific discovery, capable of generating hypotheses, running experiments, analyzing data, and writing scientific manuscripts. This system has successfully produced the first workshop paper written entirely by AI and accepted through peer review, marking a significant step towards fully autonomous research.","content":"## Introduction\n\nThe AI Scientist-v2, developed by SakanaAI, represents a groundbreaking step in automated scientific discovery. This generalized, end-to-end agentic system is designed to autonomously conduct scientific research, from generating novel hypotheses to writing peer-reviewed papers. It distinguishes itself from its predecessor by removing reliance on human-authored templates, generalizing across various Machine Learning (ML) domains, and employing a progressive agentic tree search guided by an experiment manager agent.\n\nThis system has already achieved a notable milestone, generating the first workshop paper written entirely by AI and accepted through peer review. While AI Scientist-v2 takes a broader, more exploratory approach compared to v1, it is ideal for open-ended scientific exploration, pushing the boundaries of what AI can achieve in research.\n\n## Installation\n\nTo get started with AI Scientist-v2, you'll need a Linux environment with NVIDIA GPUs, CUDA, and PyTorch. The installation process involves setting up a Conda environment and installing necessary dependencies.\n\n1.  **Create a Conda environment:**\n    bash\n    conda create -n ai_scientist python=3.11\n    conda activate ai_scientist\n    \n\n2.  **Install PyTorch with CUDA support:** (Adjust `pytorch-cuda` version for your setup)\n    bash\n    conda install pytorch torchvision torchaudio pytorch-cuda=12.4 -c pytorch -c nvidia\n    \n\n3.  **Install PDF and LaTeX tools:**\n    bash\n    conda install anaconda::poppler\n    conda install conda-forge::chktex\n    \n\n4.  **Install Python package requirements:**\n    bash\n    pip install -r requirements.txt\n    \n\nEnsure you set up API keys for supported models (OpenAI, Gemini, Claude via AWS Bedrock) and optionally for Semantic Scholar for enhanced literature search. Refer to the official repository for detailed instructions on API key configuration.\n\n## Examples\n\nThe AI Scientist-v2 workflow typically involves two main stages: generating research ideas and then running paper generation experiments.\n\n### 1. Generate Research Ideas\n\nFirst, you use the `perform_ideation_temp_free.py` script to brainstorm and refine research ideas based on a high-level topic description you provide. This script leverages LLMs and tools like Semantic Scholar to check for novelty.\n\n**Example Command:**\n\nbash\npython ai_scientist/perform_ideation_temp_free.py \\\n --workshop-file \"ai_scientist/ideas/my_research_topic.md\" \\\n --model gpt-4o-2024-05-13 \\\n --max-num-generations 20 \\\n --num-reflections 5\n\n\nThis will generate a JSON file containing structured research ideas, which will be used in the next step.\n\n### 2. Run AI Scientist-v2 Paper Generation Experiments\n\nOnce you have your research ideas, you can launch the main pipeline to run experiments via agentic tree search, analyze results, and generate a paper draft.\n\n**Example Command:**\n\nbash\npython launch_scientist_bfts.py \\\n --load_ideas \"ai_scientist/ideas/my_research_topic.json\" \\\n --load_code \\\n --add_dataset_ref \\\n --model_writeup o1-preview-2024-09-12 \\\n --model_citation gpt-4o-2024-11-20 \\\n --model_review gpt-4o-2024-11-20 \\\n --model_agg_plots o3-mini-2025-01-31 \\\n --num_cite_rounds 20\n\n\nAfter completion, you will find a timestamped log folder in `experiments/` containing the tree visualization and, eventually, the generated PDF paper.\n\n## Why Use It\n\nAI Scientist-v2 offers a powerful platform for accelerating scientific discovery and exploring new research frontiers. Its key advantages include:\n\n*   **Full Autonomy:** It automates the entire research lifecycle, from hypothesis generation to paper writing, significantly reducing human effort and time.\n*   **Generalization:** Unlike previous versions, it can generalize across various ML domains, making it a versatile tool for diverse research areas.\n*   **Agentic Tree Search:** The progressive agentic tree search, guided by an experiment manager, allows for more exploratory and less template-dependent research.\n*   **Pioneering AI Research:** It represents a significant milestone in AI's capability to contribute to scientific literature, having produced a peer-reviewed paper entirely by AI.\n\n**Caution:** This codebase will execute Large Language Model (LLM)-written code. It is crucial to run this within a controlled sandbox environment (e.g., a Docker container) due to potential risks like dangerous packages or unintended processes.\n\n## Links\n\n*   **GitHub Repository:** [https://github.com/SakanaAI/AI-Scientist-v2](https://github.com/SakanaAI/AI-Scientist-v2){:target=\"_blank\"}\n*   **Paper:** [https://pub.sakana.ai/ai-scientist-v2/paper](https://pub.sakana.ai/ai-scientist-v2/paper){:target=\"_blank\"}\n*   **Blog Post:** [https://sakana.ai/ai-scientist-first-publication/](https://sakana.ai/ai-scientist-first-publication/){:target=\"_blank\"}\n*   **ICLR2025 Workshop Experiment:** [https://github.com/SakanaAI/AI-Scientist-ICLR2025-Workshop-Experiment](https://github.com/SakanaAI/AI-Scientist-ICLR2025-Workshop-Experiment){:target=\"_blank\"}","metrics":{"detailViews":6,"githubClicks":3},"dates":{"published":null,"modified":"2026-05-08T23:38:14.000Z"}}