ClawHire

Search & Data Extraction MCP Servers

130 MCP servers in the search & data extraction category. Click any server for install commands, Claude Code setup, and GitHub source.

Search1API

Search and crawl in one API

Search & Data Extraction
Vectorize

[Vectorize](https://vectorize.io) MCP server for advanced retrieval, Private Deep Research, Anything-to-Markdown file extraction and text chunking.

Search & Data Extraction
Clojars

Obtains latest dependency details for Clojure libraries.

Search & Data Extraction
Google News

Google News search capabilities with automatic topic categorization and multi-language support via SerpAPI integration.

Search & Data Extraction
GXtract

GXtract is a MCP server designed to integrate with VS Code and other compatible editors (documentation: [sascharo.github.io/gxtract](https://sascharo.github.io/gxtract)). It provides a suite of tools for interacting with the GroundX platform, enabling you to leverage its powerful document understanding capabilities directly within your development environment.

Search & Data Extraction
just-every/mcp-read-website-fast

Fast, token-efficient web content extraction that converts websites to clean Markdown. Features Mozilla Readability, smart caching, polite crawling with robots.txt support, and concurrent fetching with minimal dependencies.

Search & Data Extraction
just-every/mcp-screenshot-website-fast

High-quality screenshot capture optimized for Claude Vision API. Automatically tiles full pages into 1072x1072 chunks (1.15 megapixels) with configurable viewports and wait strategies for dynamic content.

Search & Data Extraction
Kagi

Kagi search API integration

Search & Data Extraction
Nexus

Web search server that integrates Perplexity Sonar models via OpenRouter API for real-time, context-aware search with citations

Search & Data Extraction
SearXNG

A Model Context Protocol Server for [SearXNG](https://docs.searxng.org)

Search & Data Extraction
Rippr

YouTube transcript extraction for AI agents. Clean text, timestamps, or structured JSON from any video. No API keys required. Install via `npx rippr-mcp`.

Search & Data Extractionnpx rippr-mcp
Job Searchoor

An MCP server for searching job listings with filters for date, keywords, remote work options, and more.

Search & Data Extraction
Aeo Cli

Audit URLs for AI crawler readiness β€” checks robots.txt, llms.txt, JSON-LD schema, and content density with 0-100 AEO scoring.

Search & Data Extraction
Open WebSearch

Web search using free multi-engine search (NO API KEYS REQUIRED) β€” Supports Bing, Baidu, DuckDuckGo, Brave, Exa, and CSDN.

Search & Data Extraction
MCPSerp

Google SERP search including web, images, news, maps, places, videos, and knowledge graph results via Ace Data Cloud API.

Search & Data Extraction
Markcrawl

Crawl websites into clean Markdown, search pages, and extract structured data with LLMs. Built-in MCP server for web research and RAG pipelines.

Search & Data Extraction
Webpage Screenshot Mcp

A MCP server for taking screenshots of webpages to use as feedback during UI developement.

Search & Data Extraction
Mcp Simple Arxiv

MCP for LLM to search and read papers from arXiv

Search & Data Extraction
Mcp Simple Pubmed

MCP to search and read medical / life sciences papers from PubMed.

Search & Data Extraction
Nyt

Search articles using the NYTimes API

Search & Data Extraction
Mcp Server Rag Web Browser

An MCP server for Apify's open-source RAG Web Browser Actor to perform web searches, scrape URLs, and return content in Markdown.

Search & Data Extraction
Mcp Server

Search 800 000+ Polish public tenders (BZP + TED). Profiles of procuring entities and contractors by NIP, market statistics by CPV/province, 90+ term procurement glossary.

Search & Data Extraction
Argus

Multi-provider search broker with automatic fallback, RRF ranking, content extraction, and budget enforcement.

Search & Data Extraction
Idapixl Web Research Mcp

Pay-per-use web research for AI agents on Apify. Search (Brave + DuckDuckGo), fetch pages to clean markdown, and multi-step research with relevance scoring and key fact extraction.

Search & Data Extraction
Arxiv Mcp Server

Search ArXiv research papers

Search & Data Extraction
Boikot

Model Context Protocol Server for looking up company ethics information. Learn about the ethical and unethical actions of major companies.

Search & Data Extraction
Brave Search Mcp Server

Web search capabilities using Brave's Search API

Search & Data Extraction
Activitypub Mcp

A comprehensive MCP server that enables LLMs to explore and interact with the Fediverse through ActivityPub protocol. Features WebFinger discovery, timeline fetching, instance exploration, and cross-platform support for Mastodon, Pleroma, Misskey, and other ActivityPub servers.

Search & Data Extraction
Gopher Mcp

Modern, cross-platform MCP server enabling AI assistants to browse and interact with both Gopher protocol and Gemini protocol resources safely and efficiently. Features dual protocol support, TLS security, and structured content extraction.

Search & Data Extraction
Unsplash Mcp

Unsplash photo search with proper attribution. Returns ready-to-use attribution text and HTML for each photo, making it easy for LLMs to build content pages with properly credited images. Includes search, random photos, and download tracking.

Search & Data Extraction
Mcp Page Capture

MCP server that captures webpage screenshots, with viewport or full-page options and base64 PNG output.

Search & Data Extraction
Openai Websearch Mcp

This is a Python-based MCP server that provides OpenAI `web_search` built-in tool.

Search & Data Extraction
Crawleo MCP

– Crawleo Search & Crawl API

Search & Data Extraction
Kagi Ken Mcp

Work with Kagi *without* API access (you'll need to be a customer, tho). Searches and summarizes. Uses Kagi session token for easy authentication.

Search & Data Extraction
Dappier Mcp

Enable fast, free real-time web search and access premium data from trusted media brandsβ€”news, financial markets, sports, entertainment, weather, and more. Build powerful AI agents with Dappier.

Search & Data Extraction
Mcp Opennutrition

Local MCP server for searching 300,000+ foods, nutrition facts, and barcodes from the OpenNutrition database.

Search & Data Extraction
Trieve

Crawl, embed, chunk, search, and retrieve information from datasets through [Trieve](https://trieve.ai)

Search & Data Extraction
Domain Search Mcp

Fast domain availability aggregator with pricing. Checks Porkbun, Namecheap, GoDaddy, RDAP & WHOIS. Includes bulk search, registrar comparison, AI-powered suggestions, and social media handle checking.

Search & Data Extraction
Gsc Mcp

MCP server for Google Search Console & Indexing API β€” 13 tools for search analytics, sitemaps, URL inspection, and batch indexing.

Search & Data Extraction
Domain Suite Mcp

Full domain lifecycle management: availability checking (zero config), registration, DNS, SSL, email auth (SPF/DKIM/DMARC), and WHOIS across Porkbun, Namecheap, GoDaddy, and Cloudflare. 21 tools.

Search & Data Extraction
Muumuu Domain Mcp

Official remote MCP server for Muumuu Domain (GMO Pepabo). Search and register domains, manage owned domains and contracts, and configure DNS records via natural language.

Search & Data Extraction
Mcp Server Dumplingai

Access data, web scraping, and document conversion APIs by [Dumpling AI](https://www.dumplingai.com/)

Search & Data Extraction
Open Sales Stack

Collection of B2B sales intelligence MCP servers. Includes website analysis, tech stack detection, hiring signals, review aggregation, ad tracking, social profiles, financial reporting and more for AI-powered prospecting by [Ekas](https://ekas.io/)

Search & Data Extraction
Melrose Mcp

Plays [Melrōse](https://melrōse.org) music expressions as MIDI

Search & Data Extraction
Decompose

Decompose text into classified semantic units with authority, risk, attention scores, and entity extraction. No LLM. Deterministic. Works as MCP server or CLI.

Search & Data Extraction
Mcp Hn

An MCP server to search Hacker News, get top stories, and more.

Search & Data Extraction
Jdl Mcp Server

Search 1M+ enriched job listings from 20,000+ companies. Filter by skills, salary, location, seniority, remote type, and more. Free β€” 500 calls/day, no signup required. Also available as a remote MCP server at `https://mcp.jobdatalake.com`.

Search & Data Extraction
Fiverr Mcp Server

Search Fiverr gigs, view seller profiles, compare pricing packages, and read reviews. No API key required.

Search & Data Extraction
Youtube Mcp

– MCP server that transcribes YouTube videos to text. Uses yt-dlp to download audio and OpenAI's Whisper-1 for more precise transcription than youtube captions. Provide a YouTube URL and get back the full transcript splitted by chunks for long videos.

Search & Data Extraction
Free Web Search Ultimate

Zero-cost, privacy-first universal web search MCP server. Enforces a **Search-First** paradigm β€” instructs LLMs to retrieve real-time information before answering factual questions. Supports 10+ search engines (DuckDuckGo, Bing, Google, Brave, Wikipedia, Arxiv, YouTube, Reddit) and deep page browsing. No API key required.

Search & Data Extraction
Multi Research Agents

a KTOR server/ MCP server written in Kotlin applying multi-agents schools in a flexible research system to be used with coding or for research any general case.

Search & Data Extraction
Giskard Search

Pay-per-use semantic web search for AI agents. Powered by SearxNG, agents pay in sats via Lightning Network micropayments β€” no API keys required. Self-hosted with phoenixd.

Search & Data Extraction
Agent Domain Service Mcp

AI-powered domain brainstorming, analysis, and availability checking via AgentDomainService.com. Generate creative domain names from descriptions, get AI scoring for brandability/memorability, and check real-time availability with pricing. No API keys required.

Search & Data Extraction
Hasdata Mcp

Remote MCP server providing structured data APIs for Google (Search, Maps, Trends, Flights), Amazon, Airbnb, Zillow, Yelp, and more. 40+ tools returns clean JSON data instead of browser automation or raw HTML scraping. Designed for AI agents requiring reliable hosted data access.

Search & Data Extraction
Mcp Paperswithcode

MCP to search through PapersWithCode API

Search & Data Extraction
Unsplash Mcp Server

) - A MCP server for Unsplash image search.

Search & Data Extraction
Himalayas Mcp

Access tens of thousands of remote job listings and company information. This public MCP server provides real-time access to Himalayas' remote jobs database.

Search & Data Extraction
Mcp Claude Hackernews

An integration that allows Claude Desktop to interact with Hacker News using the Model Context Protocol (MCP).

Search & Data Extraction
Mcp Domain Availability

A Model Context Protocol (MCP) server that enables Claude Desktop to check domain availability across 50+ TLDs. Features DNS/WHOIS verification, bulk checking, and smart suggestions. Zero-clone installation via uvx.

Search & Data Extraction
Mcp Rss Aggregator

Model Context Protocol Server for aggregating RSS feeds in Claude Desktop.

Search & Data Extraction
Rss Feeds Mcp

RSS feeds MCP server with 8 tools β€” fetch, filter, search, and manage RSS feeds by category or source. Zero config, no API keys required.

Search & Data Extraction
Mcp Ip2whois

MCP server that provides comprehensive WHOIS lookup capabilities using the IP2WHOIS API. This server allows AI agents to query domain registration details, including expiry dates, registrar information, and registrant data.

Search & Data Extraction
Naver Search Mcp

MCP server for Naver Search API integration, supporting blog, news, shopping search and DataLab analytics features.

Search & Data Extraction
Fetcher Mcp

MCP server for fetching web page content using Playwright headless browser, supporting Javascript rendering and intelligent content extraction, and outputting Markdown or HTML format.

Search & Data Extraction
G Search Mcp

A powerful MCP server for Google search that enables parallel searching with multiple keywords simultaneously.

Search & Data Extraction
Overseerr Mcp

Integrate AI assistants with Overseerr and the Seerr (the unified successor) for automated media discovery, requests, and management in Plex, Jellyfin, and Emby ecosystems.

Search & Data Extraction
Stocky

An MCP server for searching and downloading royalty-free stock photography from Pexels and Unsplash. Features multi-provider search, rich metadata, pagination support, and async performance for AI assistants to find and access high-quality images.

Search & Data Extraction
Anybrowse

Convert any URL to LLM-ready Markdown via real Chrome browsers. 3 tools: scrape, crawl, search. Free via MCP, pay-per-use via x402. Remote MCP endpoint: `https://anybrowse.dev/mcp`

Search & Data Extraction
Json Mcp Filter

– Stop bloating your LLM context. Query & Extract only what you need from your JSON files.

Search & Data Extraction
Web Analyzer MCP

Extracts clean web content for RAG and provides Q&A about web pages.

Search & Data Extraction
Mcp Tavily

– Tavily AI search API

Search & Data Extraction
Bing Search Mcp

Web search capabilities using Microsoft Bing Search API

Search & Data Extraction
Korean Data Mcp

Real-time Korean web data β€” Naver place reviews, Melon music chart, Daangn/Bunjang marketplace listings, Naver news, Musinsa fashion rankings. 7 tools powered by Apify actors. Requires APIFY_TOKEN.

Search & Data Extraction
Content Core

Extract content from URLs, documents, videos, and audio files using intelligent auto-engine selection. Supports web pages, PDFs, Word docs, YouTube transcripts, and more with structured JSON responses.

Search & Data Extraction
Linkedapi Mcp

MCP server that lets AI assistants control LinkedIn accounts and retrieve real-time data.

Search & Data Extraction
Mineru Mcp

MCP server for MinerU document parsing API. Parse PDFs, images, DOCX, and PPTX with OCR (109 languages), batch processing (200 docs), page ranges, and local file upload. 73% token reduction with structured output.

Search & Data Extraction
Brightdata Mcp

Discover, extract, and interact with the web - one interface powering automated access across the public internet.

Search & Data Extraction
Marketplace Search Mcp

Search marketplaces (TCGPlayer, Reverb, Thumbtack), verify professional licenses (contractor, nurse across US states), and look up PSA card grading population data.

Search & Data Extraction
Brave Search Mcp

Web, Image, News, Video, and Local Point of Interest search capabilities using Brave's Search API

Search & Data Extraction
Nab

Ultra-fast web fetcher and MCP server with HTTP/3, JS rendering, anti-fingerprinting, browser cookie auth, and 1Password integration. Fetches any URL as clean Markdown for AI context.

Search & Data Extraction
Server Fetch

Efficient web content fetching and processing for AI consumption

Search & Data Extraction
Mcp Webresearch

Search Google and do deep web research on any topic

Search & Data Extraction
Newsmcp

Real-time world news for AI agents β€” events clustered from hundreds of sources, classified by topic and geography, ranked by importance. Free, no API key. `npx -y @newsmcp/server`

Search & Data Extractionnpx -y @newsmcp/server
Wet Mcp

Web search (embedded SearXNG), content extraction, and library docs indexing with hybrid search (FTS5 + semantic). Built-in Qwen3 embedding, no API keys required.

Search & Data Extraction
Duckduckgo Mcp Server

Web search using DuckDuckGo

Search & Data Extraction
Mcp Local Rag

"primitive" RAG-like web search model context protocol (MCP) server that runs locally. No APIs needed.

Search & Data Extraction
NyxDocs

Specialized MCP server for cryptocurrency project documentation management with multi-blockchain support (Ethereum, BSC, Polygon, Solana).

Search & Data Extraction
Scout Intel Mcp

Web intelligence MCP server for AI agents. 7 tools for SERP analysis, competitor research, market trends, content gap analysis, keyword insights, audience discovery, and citation tracking. Install via `pip install scout-intel-mcp`.

Search & Data Extractionpip install scout-intel-mcp
MinerU Ecosystem

Official MinerU document parsing MCP ([mineru-open-mcp](https://pypi.org/project/mineru-open-mcp/) on PyPI). Converts PDFs, doc/docx/ppt/pptx, images, and spreadsheets to Markdown via the [MinerU](https://mineru.net) API; free Flash mode without an API key (about 20 pages per file); optional `MINERU_API_TOKEN` for higher limits.

Search & Data Extraction
Pdfmux

PDF extraction router with built-in MCP server. Classifies each page (digital, scanned, tables) and routes to the best backend (PyMuPDF, Docling, OCR, or optional LLM fallback). Per-page confidence scoring flags low-quality pages and auto-reextracts them β€” prevents silent RAG failures. Zero config: `pip install pdfmux`. MIT licensed.

Search & Data Extractionpip install pdfmux
Search Mcp

Highest Accuracy Web Search for AI

Search & Data Extraction
Spectrawl

Unified web layer for AI agents. Search (8 engines), stealth browse, cookie auth, and act on 24 platforms. 5,000 free searches/month via Gemini Grounded Search.

Search & Data Extraction
Task Mcp

Highest Accuracy Deep Research and Batch Tasks MCP

Search & Data Extraction
Govuk Mcp

Search GOV.UK content, retrieve full government pages, look up organisations, and resolve UK postcodes to local authorities. 5 read-only tools, no API keys required.

Search & Data Extraction
Semanticapi Mcp

Natural language API discovery β€” search 700+ API capabilities, get endpoints, auth setup, and code snippets. Supports auto-discovery of new APIs.

Search & Data Extraction
Mcp Server Webcrawl

Advanced search and retrieval for web crawler data. Supports WARC, wget, Katana, SiteOne, and InterroBot crawlers.

Search & Data Extraction
Ocds Mcp

German public procurement data (OCDS) β€” semantic search, tender matching with company profiles, and structured filtering.

Search & Data Extraction
Catalysishub Mcp Server

Unofficial MCP server for searching and retrieving scientific data from the Catalysis Hub database, providing access to computational catalysis research and surface reaction data.

Search & Data Extraction
Opentk Mcp

Access Dutch Parliament (Tweede Kamer) information including documents, debates, activities, and legislative cases through structured search capabilities (based on opentk project by Bert Hubert)

Search & Data Extraction
Mcp Server Deep Research

MCP server providing OpenAI/Perplexity-like autonomous deep research, structured query elaboration, and concise reporting.

Search & Data Extraction
Mcp Wolframalpha

An MCP server lets AI assistants use the Wolfram Alpha API for real-time access to computational knowledge and data.

Search & Data Extraction
Partle Mcp

Search products and stores in nearby physical stores. Find what you need locally instead of waiting for delivery. Remote MCP server (Streamable HTTP, no API key required).

Search & Data Extraction
Scout Mcp

Multi-source search across code registries (GitHub, npm, PyPI), academic indexes (arXiv, Semantic Scholar), social platforms (HN, Reddit, X), and community blogs (Dev.to, Hashnode, Qiita, Zenn). Parallel fetch with structured JSON output. `npx -y scout-cli`.

Search & Data Extractionnpx -y scout-cli
Scraperapi Mcp

MCP server for ScraperAPI web scraping with JavaScript rendering, geotargeting, premium proxies, and auto-parsing support.

Search & Data Extraction
Scrapercity Cli

B2B lead generation with 20+ tools including Apollo, Google Maps, email finder, email validator, mobile finder, skip trace, and ecommerce store data.

Search & Data Extraction
Searchcraft Mcp Server

Official MCP server for managing Searchcraft clusters, creating a search index, generating an index dynamically given a data file and for easily importing data into a search index given a feed or local json file.

Search & Data Extraction
MCP Searxng

An MCP Server to connect to searXNG instances

Search & Data Extraction
Opengraph Io Mcp

OpenGraph.io API integration for extracting OG metadata, taking screenshots, scraping web content, querying sites with AI, and generating branded images (illustrations, diagrams, social cards, icons, QR codes) with iterative refinement.

Search & Data Extraction
Serpapi Mcp

SerpApi MCP Server for Google and other search engine results. Provides multi-engine search across Google, Bing, Yahoo, DuckDuckGo, YouTube, eBay, and more with real-time weather data, stock market information, and flexible JSON response modes.

Search & Data Extraction
Sifter

Structure any document, query it like a database. Open-source extraction engine that turns any document into typed, schema-defined records, queryable in natural language from Claude, ChatGPT, Gemini, or any MCP client.

Search & Data Extraction
Linkmeta Api

Free URL metadata extraction API (Open Graph, Twitter Cards, favicons, JSON-LD). No API key required.

Search & Data Extraction
Rescuedogs Mcp Server

Search and discover rescue dogs from European and UK organizations with AI-powered personality matching and detailed profiles.

Search & Data Extraction
Mcp Server

Convert any URL to clean, token-efficient Markdown for AI agents. API-backed extraction with token counting, CSS selector support, and configurable caching via [StripFeed](https://www.stripfeed.dev).

Search & Data Extraction
Arxiv Latex Mcp

Get the LaTeX source of arXiv papers to handle mathematical content and equations

Search & Data Extraction
Talonic Mcp

Schema-validated document extraction with searchable workspace memory. Extract structured fields from PDFs, scans, images, and forms; AI agents can also search, filter, and query past extractions.

Search & Data Extraction
GeekNews MCP Server

An MCP Server that retrieves and processes news data from the GeekNews site.

Search & Data Extraction
The Data Collector

MCP server for scraping Hacker News, Bluesky, and Substack with x402 micropayment support. Tools: hn_search, bluesky_search, substack_search. $0.05/call via USDC on Base.

Search & Data Extraction
Tat Mcp Server

Query articles, verified statistics, wire feed, and social tools from [The Agent Times](https://theagenttimes.com), the AI-native newspaper covering the agent economy. 13 tools including search, comments, citations, and agent leaderboards. No API key required.

Search & Data Extraction
Enrichr Mcp Server

A MCP server that provides gene set enrichment analysis using the Enrichr API

Search & Data Extraction
Mcp Server Tavily

– Tavily AI search API

Search & Data Extraction
Urlbox Mcp Server

A reliable MCP server for generating and managing screenshots, PDFs, and videos, performing AI-powered screenshot analysis, and extracting web content (Markdown, metadata, and HTML) via the [Urlbox](https://urlbox.com) API.

Search & Data Extraction
Ncbi Mcp Server

Comprehensive NCBI/PubMed literature search server with advanced analytics, caching, MeSH integration, related articles discovery, and batch processing for all life sciences and biomedical research.

Search & Data Extraction
Web Search Plus Mcp

Multi-provider web search with intelligent auto-routing (Serper, Tavily, Exa). Available via `uvx web-search-plus-mcp`.

Search & Data Extractionuvx web-search-plus-mcp
Agent Toolbox

Production-ready MCP server providing 13 tools for AI agents: web search, content extraction, screenshots, weather, finance, email validation, translation, news, GeoIP, WHOIS, DNS, PDF extraction, and QR code generation. 1,000 free calls/month, no setup required.

Search & Data Extraction
Webpeel

Smart web fetcher for AI agents with auto-escalation from HTTP to headless browser to stealth mode. Includes 9 MCP tools: fetch, search, crawl, map, extract, batch, screenshot, jobs, and agent. Achieved 100% success rate on a 30-URL benchmark.

Search & Data Extraction
Baseline Mcp Server

MCP server that searches Baseline status using Web Platform API

Search & Data Extraction
Duckduckgo Mcp Server

This is a TypeScript-based MCP server that provides DuckDuckGo search functionality.

Search & Data Extraction
Google Researcher Mcp

Comprehensive research tools including Google Search (web, news, images), web scraping with JavaScript rendering, academic paper search (arXiv, PubMed, IEEE), patent search, and YouTube transcript extraction.

Search & Data Extraction
Youtube Summarize

MCP server that fetches YouTube video transcripts and optionally summarizes them. Supports multiple transcript formats (text, JSON, SRT, WebVTT), multi-language retrieval, and flexible YouTube URL parsing.

Search & Data Extraction
Mcp Zoomeye

Querying network asset information by ZoomEye MCP Server

Search & Data Extraction