ClawHire

Pdfmux

PDF extraction router with built-in MCP server. Classifies each page (digital, scanned, tables) and routes to the best backend (PyMuPDF, Docling, OCR, or optional LLM fallback). Per-page confidence scoring flags low-quality pages and auto-reextracts them — prevents silent RAG failures. Zero config: `pip install pdfmux`. MIT licensed.

Install Pdfmux

Python

pip install pdfmux

Use Pdfmux with Claude Code

Once the server is on your machine, register it with Claude Code so the agent can call it like any other tool. The general pattern is:

claude mcp add nameetp-pdfmux -- pip install pdfmux

After it's registered, run claude and ask anything that requires Pdfmux. Claude Code will negotiate the MCP handshake and surface the server's tools, prompts, and resources to the model automatically. See our Claude Code MCP guide for environment variables, scope flags, and credential handling.

What this server is for

Pdfmux sits in the search & data extraction category. It's community-maintained — check the repo's last-commit date and open issues before depending on it in production. The full description from the source list:

PDF extraction router with built-in MCP server. Classifies each page (digital, scanned, tables) and routes to the best backend (PyMuPDF, Docling, OCR, or optional LLM fallback). Per-page confidence scoring flags low-quality pages and auto-reextracts them — prevents silent RAG failures. Zero config: `pip install pdfmux`. MIT licensed.

FAQ

Is Pdfmux free?

Yes — every MCP server in the ClawHire directory is open-source or freely available. Some servers proxy to paid APIs (you supply your own keys), but the server itself is free.

Does it work with Claude Desktop / Cursor / Windsurf?

If a client speaks the Model Context Protocol, it can use this server. The transport (stdio vs SSE vs HTTP) needs to match what the client supports, but the server itself is client-agnostic.

How do I report a bug or contribute?

Open an issue or PR on the GitHub repository.

More search & data extraction servers