asta-paper-finder: A Frozen-in-Time Agent for Reproducing Paper Finder Evaluations
This repository profile is provided by osrepos.com, an open source repository discovery platform.
Summary
asta-paper-finder is a standalone, "frozen-in-time" version of the AllenAI Paper Finder agent. This repository provides the code specifically for reproducing evaluation results, allowing researchers to locate sets of papers based on content and metadata criteria. It offers a stable snapshot of the agent's core paper-finding capabilities.
Repository Information
Topics
Click on any tag to explore related repositories
Use at your own risk
OSRepos shares public repositories for knowledge and discovery only. Any installation, execution, configuration, or use of code from these repositories is the user's own responsibility. Always review the repository, source code, dependencies, licenses, and security implications before running or installing anything. OSRepos is not responsible for issues, damages, or losses resulting from third-party repositories.
Introduction
The asta-paper-finder repository hosts a standalone, "frozen-in-time" version of the AllenAI Paper Finder agent. This specific release is designed for reproducing evaluation results, offering a stable and consistent environment to assess the agent's performance. PaperFinder is an intelligent agent built to assist users in locating relevant academic papers by analyzing content-based and metadata criteria through a pipeline of manual-coded components and LLM decisions. While this version focuses on core single-turn paper search functionality, it provides a clear insight into the agent's underlying logic and capabilities, distinct from the actively maintained live version.
Installation
To set up and run the asta-paper-finder agent, follow these steps:
Secrets File
The agent requires several API keys. Create a .env.secret file within agents/mabool/api/conf and define the following keys:
OPENAI_API_KEYS2_API_KEYCOHERE_API_KEYGOOGLE_API_KEY
Environment
The project uses uv for dependency management. Prepare your environment by running:
make sync-dev
Running the Agent
Navigate to the API directory and start the FastAPI server:
cd agents/mabool/api
make start-dev
Once the server is running, you can interact with it via cURL or the Swagger web interface.
Examples
The agent exposes a POST API endpoint /api/2/rounds for paper searching. Here's how to use it:
The API accepts a JSON payload with the following arguments:
{
"paper_description": "string",
"operation_mode": "infer",
"inserted_before": "string",
"read_results_from_cache": false
}
paper_description(REQUIRED): A natural language description of the papers you are searching for.operation_mode(default="infer"): Specifies the search intensity. Options are "infer", "fast", or "diligent". "infer" and "fast" perform a quick search (approx. 30 seconds), while "diligent" conducts a more exhaustive search (approx. 3 minutes).inserted_before(default=None): An optional upper-bound date in YYYY-MM-DD format to limit results to papers published before this date.read_results_from_cache(default=False): If set totrue, the agent will attempt to return results from a disk-based cache if available.
Why Use asta-paper-finder?
asta-paper-finder is invaluable for researchers and developers interested in the core mechanics of an LLM-powered paper-finding agent. Its "frozen-in-time" nature ensures stability and consistency, making it ideal for:
- Reproducibility: Accurately reproduce and verify evaluation results reported for the Paper Finder agent.
- Understanding Core Logic: Study the pipeline of components, LLM decisions, and relevance judgments without the complexities of a live, evolving product.
- Benchmarking: Use a stable version for comparative analysis and developing new benchmarks for paper-seeking agents.
This repository offers a focused view on the single-turn paper search functionality, stripped of multi-turn interaction, UI, and production environment integrations, providing a clean slate for academic and experimental purposes.
Links
- GitHub Repository: https://github.com/allenai/asta-paper-finder
- Live PaperFinder Agent: https://paperfinder.allen.ai/
Related repositories
Similar repositories that may be relevant next.

AuditNLG: Auditing Generative AI for Trustworthiness
June 25, 2026
AuditNLG is an open-source library from Salesforce designed to enhance the trustworthiness of generative AI language models. It provides state-of-the-art techniques to detect and improve factualness, safety, and constraint adherence in AI-generated text. This library simplifies the process of auditing AI outputs, offering explanations and alternative suggestions for problematic content.

Odysseus: A Comprehensive Self-Hosted AI Workspace for Productivity
June 25, 2026
Odysseus is a powerful self-hosted AI workspace designed to integrate various AI-powered tools into a single platform. It offers functionalities for chat, agents, deep research, document management, email, and calendar, supporting both local and API models. This comprehensive solution aims to enhance productivity and streamline AI workflows in a private environment.

Headroom: Drastically Reduce LLM Token Usage for AI Agents
June 25, 2026
Headroom is an innovative context compression layer for AI agents, designed to significantly reduce token usage for LLMs. It achieves 60-95% fewer tokens across various inputs like tool outputs, logs, files, and RAG chunks, all while preserving answer accuracy. This powerful tool enhances efficiency and cost-effectiveness for AI interactions.

spacy-llm: Integrating LLMs into Structured NLP Pipelines with spaCy
June 24, 2026
spacy-llm seamlessly integrates Large Language Models (LLMs) into spaCy, offering a modular system for rapid prototyping and transforming unstructured LLM responses into robust outputs for various NLP tasks. It supports a wide range of LLMs, including OpenAI, Cohere, Anthropic, and open-source models, enabling users to combine the power of LLMs with spaCy's production-ready capabilities. This package allows for quick experimentation and the creation of efficient, reliable, and controlled NLP systems.
Source repository
Open the original repository on GitHub.