ArXiv MCP Server
MCP ServerFreeSearch and read arXiv academic papers and abstracts via MCP.
Capabilities9 decomposed
arxiv paper search with category and date filtering
Medium confidenceExecutes structured queries against the arXiv API using the arxiv Python client library, supporting keyword search combined with category filters (cs.AI, physics.*, etc.) and date range constraints. The search_papers tool normalizes user queries into arXiv query syntax, handles pagination for large result sets, and returns metadata including title, authors, publication date, and abstract. Results are streamed back to the MCP client without requiring local storage, enabling real-time discovery workflows.
Integrates directly with arXiv's native API client library rather than web scraping, enabling reliable pagination and category filtering. The MCP wrapper normalizes search parameters into arXiv query syntax, abstracting protocol complexity from AI assistants while maintaining full access to arXiv's filtering capabilities.
More reliable and maintainable than web scraping approaches; provides native category and date filtering that semantic search tools cannot offer without additional ML infrastructure.
pdf to markdown conversion with metadata preservation
Medium confidenceDownloads papers from arXiv as PDFs and converts them to Markdown format using the pymupdf4llm library, which extracts text, preserves structural hierarchy (headers, lists, tables), and maintains reading order. The download_paper tool retrieves the PDF via arXiv's direct download endpoint, processes it locally, and stores the Markdown output in a configurable local directory. Metadata (title, authors, abstract, arXiv ID) is embedded as YAML frontmatter in the Markdown file for downstream processing.
Uses pymupdf4llm specifically designed for LLM-friendly PDF conversion, preserving document structure and hierarchy rather than naive text extraction. Embeds paper metadata as YAML frontmatter, enabling downstream tools to access citation information without separate API calls.
Produces LLM-optimized Markdown with preserved structure, unlike generic PDF-to-text tools; local caching eliminates repeated arXiv downloads, reducing latency and API load compared to on-demand conversion approaches.
local paper inventory management with metadata indexing
Medium confidenceMaintains a local directory of downloaded papers with automatic metadata indexing. The list_papers tool scans the storage directory, parses YAML frontmatter from Markdown files, and returns a structured inventory including title, authors, publication date, arXiv ID, and file path. This enables quick discovery of previously downloaded papers without API calls and supports filtering/sorting operations on the local collection.
Implements lightweight metadata indexing by parsing YAML frontmatter from locally stored Markdown files, avoiding the need for a separate database while maintaining queryable inventory. Integrates with the download_paper tool's storage pattern, creating a cohesive local knowledge base without external dependencies.
Simpler and more portable than database-backed solutions; metadata is human-readable and version-controllable, enabling easy integration with version control systems and collaborative workflows.
paper content retrieval with structured access
Medium confidenceRetrieves the full Markdown content of previously downloaded papers from local storage via the read_paper tool. The tool accepts an arXiv ID or file path, loads the Markdown file, and returns the complete content including YAML frontmatter and converted paper text. This enables AI assistants to analyze paper content in subsequent prompts without re-downloading or re-converting, supporting multi-turn analysis workflows.
Provides direct file-based access to locally stored papers without re-fetching from arXiv, enabling fast retrieval and reducing API load. Integrates with the download_paper and list_papers tools to form a complete local paper management pipeline.
Faster than re-downloading from arXiv; supports multi-turn analysis workflows where papers are accessed repeatedly across different prompts without network overhead.
deep paper analysis prompt with structured interpretation workflow
Medium confidenceProvides a specialized MCP prompt (deep-paper-analysis) that guides AI assistants through a structured workflow for analyzing academic papers. The prompt defines a multi-step process: extracting key contributions, identifying methodology, analyzing results, and synthesizing implications. When invoked, the prompt system passes the paper content (typically loaded via read_paper) to the LLM with explicit instructions for structured analysis, enabling consistent interpretation across different papers and analysis sessions.
Implements a reusable MCP prompt template that standardizes paper analysis across multiple papers and sessions, avoiding prompt engineering overhead. The prompt is versioned and managed within the MCP server, enabling consistent interpretation without requiring users to maintain separate prompt files.
Provides structured analysis without requiring users to engineer custom prompts; enables reproducible analysis workflows across teams and sessions compared to ad-hoc prompting approaches.
mcp protocol server with tool and prompt registration
Medium confidenceImplements a complete MCP (Model Context Protocol) server that registers and exposes paper management tools (search_papers, download_paper, list_papers, read_paper) and analysis prompts (deep-paper-analysis) to MCP-compatible clients. The server uses the mcp Python library to handle protocol compliance, manages stdio-based communication with clients, and routes tool calls to appropriate handlers. The server layer (src/arxiv_mcp_server/server.py) handles command parsing, response formatting, and error handling according to MCP specification.
Implements full MCP protocol compliance using the official mcp Python library, handling stdio communication, tool registration, and response formatting according to specification. The modular architecture separates server protocol handling from tool implementation, enabling easy addition of new tools without modifying core server logic.
Standards-based MCP implementation ensures compatibility with any MCP-compatible client; cleaner integration than custom API wrappers, with built-in protocol handling and error management.
configurable local paper storage with directory management
Medium confidenceManages a configurable local directory for storing downloaded papers in Markdown format. The storage system is configured via environment variables or configuration files, with a default location that can be overridden. The download_paper tool writes converted papers to this directory with consistent naming (arXiv ID-based), and list_papers/read_paper tools read from the same directory. The architecture supports multiple storage backends through configuration, enabling flexibility in deployment scenarios.
Implements flexible storage configuration through environment variables, enabling deployment across different environments (local development, Docker containers, cloud instances) without code changes. The modular design separates storage concerns from tool logic, supporting future extensions to alternative storage backends.
Configuration-driven approach enables easy deployment customization; local filesystem storage is simpler and more portable than database-backed solutions, with human-readable file organization.
async-first mcp server implementation with non-blocking i/o
Medium confidenceImplements the MCP server using Python's asyncio framework for non-blocking I/O operations, enabling concurrent handling of multiple tool calls and client requests. The server architecture uses async/await patterns throughout the tool implementations (search_papers, download_paper, list_papers, read_paper), allowing long-running operations (PDF downloads, conversions) to proceed without blocking other client requests. This enables responsive multi-turn conversations where users can trigger multiple paper downloads or searches in parallel.
Uses Python asyncio throughout the server implementation, enabling non-blocking I/O for all paper operations. The async-first design allows concurrent handling of multiple tool calls, improving responsiveness in multi-turn conversations and supporting parallel workflows.
Async implementation enables responsive handling of concurrent requests without thread management overhead; better suited to I/O-bound operations like API calls and file I/O compared to synchronous approaches.
research discovery prompt with guided exploration workflow
Medium confidenceProvides a specialized MCP prompt (research-discovery) that guides AI assistants through discovering relevant research papers based on initial research questions or topics. The prompt defines a workflow for iterative search refinement: starting with broad searches, analyzing results, identifying gaps, and refining search queries. When invoked, the prompt system provides instructions for using the search_papers tool effectively, enabling systematic exploration of the arXiv repository without requiring users to manually craft complex search queries.
Implements a reusable MCP prompt that guides iterative search refinement, enabling systematic exploration without requiring users to understand arXiv's query syntax or research domain structure. The prompt is versioned within the MCP server, ensuring consistent discovery workflows across sessions.
Provides guided discovery without requiring domain expertise or manual query engineering; enables systematic exploration compared to ad-hoc searching, improving coverage of relevant papers.
Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.
Related Artifactssharing capabilities
Artifacts that share capabilities with ArXiv MCP Server, ranked by overlap. Discovered automatically through the match graph.
arxiv-mcp-server
A Model Context Protocol server for searching and analyzing arXiv papers
daily-arXiv-ai-enhanced
Automatically crawl arXiv papers daily and summarize them using AI. Illustrating them using GitHub Pages.
Consensus
Consensus is a search engine that uses AI to find answers in scientific research.
Explainpaper
A better way to read academic papers. Upload a paper, highlight confusing text, get an explanation.
Latex MCP Server
** - MCP Server to compile latex, download/organize/read cited papers, run visualization scripts and add figures/tables to latex.
scite
A platform for discovering and evaluating scientific articles.
Best For
- ✓AI researchers and developers building literature review tools
- ✓Teams building research discovery agents that need programmatic arXiv access
- ✓Academic institutions automating paper collection workflows
- ✓AI agents that need to analyze paper content within token-limited contexts
- ✓Research teams building local knowledge bases of academic papers
- ✓Developers creating paper analysis pipelines that require structured text input
- ✓Researchers maintaining personal paper libraries
- ✓Teams building persistent knowledge bases across multiple analysis sessions
Known Limitations
- ⚠arXiv API rate limits apply (approximately 3 requests per second per IP)
- ⚠Search results limited to arXiv's native query syntax — cannot perform semantic similarity searches
- ⚠Abstracts only; full paper content requires separate download operation
- ⚠No support for advanced boolean operators beyond arXiv's native AND/OR syntax
- ⚠PDF conversion quality varies by paper format — complex layouts, multi-column text, and embedded images may not convert perfectly
- ⚠Large papers (100+ pages) produce Markdown files exceeding typical LLM context windows; requires chunking or summarization
Requirements
Input / Output
UnfragileRank
UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.
About
Community MCP server for arXiv academic paper repository. Provides tools to search papers by topic, read abstracts and metadata, download PDFs, and query recent submissions in specific categories.
Categories
Alternatives to ArXiv MCP Server
Are you the builder of ArXiv MCP Server?
Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.
Get the weekly brief
New tools, rising stars, and what's actually worth your time. No spam.
Data Sources
Looking for something else?
Search →