Audioscrape
MCP Server** - Search 1M+ hours of podcasts, interviews, talks and your private audio uploads with speaker identification and timestamps. Official Remote MCP server (via https://mcp.audioscrape.com) enabling AI assistants to access and analyze audio content through semantic and text-based search.
Capabilities8 decomposed
semantic and text-based audio search with speaker identification
Medium confidenceSearches across 1M+ hours of indexed podcast, interview, and talk audio content using dual search modes: text-based keyword matching for exact phrase discovery and semantic search for conceptual relevance. Returns segment-level results with speaker identification, precise timestamps (HH:MM:SS format), and relevance scoring (0-1 float). Implements pagination via offset/limit parameters (max 200 results per query) and supports sorting by relevance, publication date, or episode title. Results include direct URLs with timestamp anchors enabling one-click navigation to specific moments in audio.
Combines speaker identification with dual search modes (text + semantic) across 275,000+ pre-transcribed podcasts, returning segment-level results with precise timestamps and direct playback URLs. Unlike generic audio search, it indexes speaker identity and enables conceptual discovery across a curated corpus of 1M+ hours.
Faster and more accurate than manual podcast searching or generic web search because it operates on pre-transcribed, indexed audio with speaker metadata rather than requiring real-time transcription or relying on episode descriptions alone.
recent episode discovery with temporal and podcast filtering
Medium confidenceLists recently published podcast episodes with configurable lookback window (1-365 days, default 7 days) and optional filtering by specific podcast IDs. Returns structured episode metadata including title, podcast name, publication date (YYYY-MM-DD), duration in seconds, and direct episode URLs. Supports pagination via limit parameter (1-100 episodes per request). Designed as a lightweight alternative to full search for discovering fresh content within a time window.
Provides lightweight, time-windowed episode listing with optional podcast filtering, enabling efficient discovery of recent content without full-text search overhead. Optimized for agents that need to stay current with specific podcast feeds rather than search across the entire corpus.
More efficient than running broad searches for recent content because it directly indexes publication dates and returns only new episodes, avoiding the computational cost of semantic or text matching across the full 1M+ hour corpus.
episode transcript and metadata retrieval
Medium confidenceRetrieves complete episode content including full transcript, metadata (title, podcast, publication date, duration), and speaker information for a specified episode ID. Enables downstream processing of full episode context rather than segment-level search results. Implementation details are partially documented; full transcript retrieval mechanism and context window handling are not fully specified in available documentation.
Provides direct access to full episode transcripts with speaker identification and metadata, enabling AI models to process complete episode context rather than isolated search segments. Integrates with Audioscrape's 99.2% transcription accuracy and speaker identification pipeline.
More efficient than downloading raw audio and running local transcription because it returns pre-transcribed, speaker-identified content with timestamps, saving compute time and enabling immediate downstream processing.
mcp-based tool integration for ai assistants
Medium confidenceExposes Audioscrape's audio search and retrieval capabilities as standardized MCP (Model Context Protocol) tools, enabling Claude, other LLM-based assistants, and AI agents to call audio search functions natively without custom API integration code. Implements OAuth 2.0 authentication with dynamic client registration following MCP spec 6/18. All tools are read-only (no mutation capabilities). Server endpoint is mcp.audioscrape.com, supporting remote MCP connections from any MCP-compatible client.
Provides standardized MCP tool bindings for audio search, enabling AI assistants to call Audioscrape functions as native tools without custom API integration. Uses OAuth 2.0 dynamic client registration for secure, user-specific authentication within MCP framework.
Simpler than building custom API clients because it leverages MCP's standardized tool protocol, allowing Claude and other MCP-compatible assistants to call audio search functions with zero custom integration code. Enables natural language queries to be translated directly to structured audio searches.
plan-based rate limiting and quota management
Medium confidenceImplements tiered subscription plans (Free, Basic, Pro, Enterprise) with explicit monthly quotas for searches, API calls, and transcription minutes. Free plan: 10 searches/month, 50 transcription minutes/month. Basic plan: 50 searches/month, 50 API calls/month, 1000 transcription minutes/month. Pro plan: unlimited searches, 1000 API calls/month, 5000 transcription minutes/month. Enterprise: unlimited access. Rate limiting is enforced server-side at the MCP endpoint; quota consumption is tracked per API key and reset monthly.
Implements multi-dimensional quota system (searches, API calls, transcription minutes) across four subscription tiers, with monthly reset cycles. Quota enforcement is server-side at the MCP endpoint, preventing quota-aware clients from needing local tracking.
More transparent than usage-based pricing because quotas are fixed and predictable per plan, enabling builders to estimate costs upfront. Simpler than per-request metering because quota resets monthly rather than requiring real-time billing calculations.
private audio upload and indexing
Medium confidenceEnables users to upload private audio files (meetings, calls, proprietary recordings) for indexing and search within their own Audioscrape account. Uploaded audio is transcribed, speaker-identified, and indexed using the same pipeline as public podcasts, making it searchable via the standard search_audio_content tool. Private uploads are isolated to the uploading user's account and not visible to other users. Transcription of private audio consumes the user's monthly transcription minute quota.
Extends Audioscrape's indexing pipeline to user-uploaded private audio, enabling unified search across public podcasts and proprietary content. Private uploads are isolated per user and consume the user's transcription quota, creating a hybrid public/private search experience.
More integrated than managing separate transcription and search systems because private uploads use the same indexing and search infrastructure as public podcasts, enabling single-query search across both sources without custom integration.
multi-dimensional result filtering and sorting
Medium confidenceSupports filtering search results by podcast IDs, publication date range (date_from/date_to in YYYY-MM-DD format), and recency (last_week, last_month, last_year enum). Sorting options include relevance (default), publication date, and episode title, with ascending or descending order. Filters are applied server-side during search execution, reducing result set before returning to client. Pagination via offset/limit enables iterating through filtered results.
Provides server-side filtering and sorting across multiple dimensions (podcast, date, recency, relevance), reducing client-side processing and enabling efficient result refinement without fetching full result sets.
More efficient than client-side filtering because filters are applied at the server during query execution, reducing data transfer and processing latency compared to fetching all results and filtering locally.
contextual segment retrieval with surrounding content
Medium confidenceOptional include_context parameter in search_audio_content enables retrieval of surrounding audio segments adjacent to matched results, providing narrative context around search hits. When enabled, results include not just the matched segment but also preceding and following segments from the same episode, enabling AI models to understand broader context without requiring full episode retrieval. Context window size is not documented.
Enables optional retrieval of surrounding segments adjacent to search matches, providing narrative context without requiring full episode transcripts. Reduces latency compared to full episode retrieval while providing more context than isolated segment matches.
More efficient than full episode retrieval because it returns only relevant segments plus immediate context, reducing data transfer and processing overhead while still providing sufficient context for AI reasoning.
Capabilities are decomposed by AI analysis. Each maps to specific user intents and improves with match feedback.
Related Artifactssharing capabilities
Artifacts that share capabilities with Audioscrape, ranked by overlap. Discovered automatically through the match graph.
AIPODNAV
AI-enhanced podcast navigation with transcripts, summaries, chapters, mind...
Deciphr Ai
Transform podcasts into engaging blogs, captions, and videos...
Listener.fm
Elevate Your Podcast Post-Production...
Synthesis Youtube
Web-based video segment search engine for...
Speechmatics
Speechmatics is a speech-to-text technology that accurately converts audio files into text, enabling users to search, analyze, and organize their audio...
Scribewave
AI-Powered Transcription and Language...
Best For
- ✓AI assistants and agents needing to ground responses in audio evidence
- ✓Researchers analyzing discourse patterns across multiple podcasts
- ✓Content creators finding clips or references for derivative works
- ✓Teams building knowledge bases from unstructured audio content
- ✓AI agents needing current episode context for recommendations or summaries
- ✓Content aggregation systems tracking specific podcast networks
- ✓Notification systems alerting users to new episodes in followed podcasts
- ✓Researchers studying recent discourse trends in specific audio channels
Known Limitations
- ⚠Free plan limited to 10 searches/month; Basic plan 50 searches/month; Pro/Enterprise required for high-volume access
- ⚠Semantic search accuracy depends on transcription quality (99.2% claimed for clear audio, lower for noisy recordings)
- ⚠Results capped at 200 per query; discovering long-tail content requires multiple paginated requests
- ⚠Search limited to indexed content only; real-time audio or very recent uploads may not be immediately searchable
- ⚠No relevance tuning parameters exposed; ranking algorithm is opaque
- ⚠Limited to 7-365 day lookback window; cannot retrieve episodes older than 365 days
Requirements
Input / Output
UnfragileRank
UnfragileRank is computed from adoption signals, documentation quality, ecosystem connectivity, match graph feedback, and freshness. No artifact can pay for a higher rank.
About
** - Search 1M+ hours of podcasts, interviews, talks and your private audio uploads with speaker identification and timestamps. Official Remote MCP server (via https://mcp.audioscrape.com) enabling AI assistants to access and analyze audio content through semantic and text-based search.
Categories
Alternatives to Audioscrape
Are you the builder of Audioscrape?
Claim this artifact to get a verified badge, access match analytics, see which intents users search for, and manage your listing.
Get the weekly brief
New tools, rising stars, and what's actually worth your time. No spam.
Data Sources
Looking for something else?
Search →