arxiv-mcp-server
MCP ServerFreeA Model Context Protocol server for searching and analyzing arXiv papers
Capabilities8 decomposed
arxiv paper full-text search with query parsing
Medium confidenceImplements MCP tool interface to query arXiv's REST API with support for advanced search syntax (author, title, category filters, date ranges). Parses user natural language queries into arXiv API query strings, handles pagination, and returns structured metadata including abstracts, authors, publication dates, and PDF URLs. Uses HTTP client to communicate with arXiv's public API endpoint without authentication.
Exposes arXiv search as an MCP tool callable by Claude/GPT, enabling LLMs to autonomously discover papers without context switching; integrates query parsing to translate natural language into arXiv's advanced search syntax
Tighter integration with LLM workflows than direct arXiv API calls, and more discoverable than browser-based search for AI agents
paper metadata extraction and structured formatting
Medium confidenceParses arXiv API JSON responses and extracts key metadata fields (title, authors, abstract, publication date, categories, PDF URL, arXiv ID) into a consistent structured format. Formats results for readability in chat contexts, handling multi-author lists, category hierarchies, and URL encoding. Implements field mapping to normalize arXiv's native response schema into a developer-friendly output structure.
Normalizes arXiv's native API response into a consistent schema optimized for LLM consumption, with special handling for multi-author lists and category hierarchies that are common in academic papers
More structured than raw arXiv API responses and more accessible to LLMs than unformatted text, enabling downstream agents to reliably parse and act on paper metadata
mcp tool registration and schema definition
Medium confidenceRegisters arXiv search and retrieval functions as MCP tools with JSON Schema definitions that describe input parameters (query, filters, result limits) and output structure. Implements the MCP protocol's tool interface, allowing Claude, Cline, and other MCP clients to discover available tools, understand their parameters, and invoke them with proper type validation. Handles tool invocation routing and response serialization back to the MCP client.
Implements full MCP protocol compliance for tool registration, including JSON Schema validation and proper error handling, enabling seamless integration with Claude and other MCP clients without custom adapters
More standardized than custom API wrappers and more discoverable than direct function calls, allowing LLMs to autonomously understand and invoke arXiv search without hardcoded instructions
query parameter filtering and advanced search syntax translation
Medium confidenceTranslates natural language search queries into arXiv's advanced search syntax, supporting filters for author names, paper titles, publication date ranges, and arXiv categories (cs.AI, physics.quant-ph, etc.). Implements parameter validation and escaping to prevent API errors, handles multi-value filters (e.g., multiple authors OR'd together), and constructs properly formatted query strings for the arXiv API. Supports both simple keyword search and complex boolean queries.
Abstracts arXiv's non-intuitive query syntax from users, allowing natural language filter specifications that are automatically translated into valid arXiv API queries with proper escaping and validation
More user-friendly than requiring users to learn arXiv's query syntax directly, and more robust than naive string concatenation which can produce malformed queries
pagination and result batching for large result sets
Medium confidenceImplements pagination logic to handle arXiv API's result limits (typically 10-100 results per request), allowing users to retrieve large result sets across multiple API calls. Manages offset/limit parameters, accumulates results across batches, and provides mechanisms to control result count (e.g., 'get top 50 papers'). Handles empty result sets and API errors gracefully without losing previously fetched results.
Transparently handles arXiv's pagination constraints within the MCP tool interface, allowing users to request arbitrary result counts without manually managing offset/limit parameters
Simpler than manually constructing paginated API calls, and more efficient than fetching all results upfront which can exceed memory limits
error handling and api resilience with graceful degradation
Medium confidenceImplements error handling for common arXiv API failures (rate limiting, timeouts, malformed queries, network errors) with appropriate HTTP status code interpretation and user-friendly error messages. Provides retry logic with exponential backoff for transient failures, validates input parameters before API calls to prevent unnecessary requests, and returns partial results when possible rather than failing completely. Logs errors for debugging while maintaining MCP protocol compliance.
Implements MCP-aware error handling that preserves protocol compliance while providing retry logic and graceful degradation, ensuring the server remains responsive even when arXiv API is unreliable
More robust than naive API calls that fail immediately on errors, and more transparent than silent failures that leave users confused about why searches aren't working
context-aware paper recommendation based on search history
Medium confidenceTracks previous search queries and results within an MCP session, using this history to inform subsequent searches and recommendations. Analyzes patterns in user searches (e.g., frequently searched authors, categories, keywords) and suggests related papers or refined queries. Implements lightweight session state management to maintain search context across multiple tool invocations without requiring external persistence.
Maintains lightweight session-scoped context of search history within the MCP server, enabling recommendations and query refinement without requiring external knowledge bases or persistent storage
More contextual than stateless API calls, and simpler than full RAG systems while still providing some recommendation capability
abstract summarization and key insight extraction
Medium confidenceProcesses paper abstracts returned from arXiv searches and extracts key insights, research questions, and methodologies using pattern matching and NLP heuristics. Generates concise summaries suitable for quick scanning by researchers, highlighting novel contributions and relevance to search context. Integrates with Claude's native capabilities when available, delegating summarization to the LLM client rather than implementing custom NLP.
Delegates summarization to Claude when available (leveraging the LLM client's capabilities) while providing fallback heuristic-based extraction, avoiding redundant LLM calls and keeping the MCP server lightweight
More efficient than requiring separate LLM calls for each abstract, and more intelligent than simple keyword extraction
Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.
Related Artifactssharing capabilities
Artifacts that share capabilities with arxiv-mcp-server, ranked by overlap. Discovered automatically through the match graph.
@seacolour/openalex-mcp-server-tool
MCP server for querying OpenAlex papers
arxiv-paper
MCP server: arxiv-paper
Paper Search
Search and download academic papers from arXiv, PubMed, bioRxiv, medRxiv, Google Scholar, Semantic Scholar, and IACR. Fetch PDFs and extract full text to accelerate literature reviews. Get consistent metadata for easier filtering, citation, and analysis.
scholarmcp
MCP server: scholarmcp
BGPT MCP API
Search scientific papers with raw experimental data extracted from full-text studies. Returns methods, results, quality scores, and 25+ metadata fields per paper. 50 free searches, then $0.01/result with an API key.
arxiv-retrive
MCP server: arxiv-retrive
Best For
- ✓Researchers building LLM-powered literature review tools
- ✓AI agents that need to autonomously search academic papers
- ✓Claude/GPT users wanting integrated arXiv access without leaving their chat interface
- ✓Developers building knowledge management systems on top of arXiv
- ✓Teams creating literature review automation pipelines
- ✓LLM application builders needing normalized paper data structures
- ✓Claude Desktop users wanting native arXiv integration
- ✓Developers building multi-tool MCP agent systems
Known Limitations
- ⚠arXiv API rate-limited to ~3 requests per second per IP; no built-in backoff/retry logic visible
- ⚠Search results capped at arXiv's default pagination (typically 10-100 results per query)
- ⚠No full-text search of paper contents — only metadata (title, abstract, authors)
- ⚠Depends on arXiv API availability; no fallback or caching mechanism
- ⚠Only extracts metadata available via arXiv API — no full-text content extraction
- ⚠Author affiliations not included in standard arXiv API response
Requirements
Input / Output
UnfragileRank
UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.
Repository Details
Last commit: Apr 26, 2026
About
A Model Context Protocol server for searching and analyzing arXiv papers
Categories
Alternatives to arxiv-mcp-server
Search the Supabase docs for up-to-date guidance and troubleshoot errors quickly. Manage organizations, projects, databases, and Edge Functions, including migrations, SQL, logs, advisors, keys, and type generation, in one flow. Create and manage development branches to iterate safely, confirm costs
Compare →Are you the builder of arxiv-mcp-server?
Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.
Get the weekly brief
New tools, rising stars, and what's actually worth your time. No spam.
Data Sources
Looking for something else?
Search →