Convert MCP API

Document OCR, text extraction, and audio transcription via Model Context Protocol

Overview

The Convert MCP API provides AI assistants with document OCR, text extraction, and audio transcription capabilities. Extract text from PDFs, images, Office documents, and transcribe audio files using advanced OCR and speech recognition technology with support for 60+ languages.

MCP Endpoint

https://mcp.ainoflow.io/mcp/v1/convert

Transport:

HTTP (SSE)

Configuration

OpenAI Agent Builder

Add MCP server in the UI dialog:

URL:

Label:

Authentication (Access token / API Key):

Claude Desktop

Add to your Claude Desktop configuration:

Cursor IDE

Add to your Cursor MCP configuration:

Supported File Formats

Images (OCR)

PDF, JPEG, PNG, TIFF, BMP, WebP, GIF

Documents

Word (.doc, .docx), RTF, ODT, TXT

Spreadsheets

Excel (.xls, .xlsx), ODS

Presentations

PowerPoint (.ppt, .pptx), ODP

Audio (Transcription)

WAV, MP3, M4A, MP4, WebM, OGG, FLAC, AAC, Opus

Maximum file size: 100MB

Available Tools

submit_file_url

Submit file from external URL for OCR processing or audio transcription

Parameters

sourceUrl

required

HTTP/HTTPS URL to download the file

languages

optional

Comma-separated ISO codes (default: "en"). Examples: "en", "en,de,fr", "zh-cn,ja"

outputs

optional

Output formats: "text", "pdf", or "text,pdf" (default: "text"). PDF not available for audio.

Example - Document OCR

Example - Audio Transcription

Available Resources

convert://jobs/{id}

Job status with download URLs

Returns: JSON with job status and pre-signed URLs

convert://jobs/{id}/text

Plain text content or audio transcription

Returns: Extracted text directly (not URL)

convert://jobs/{id}/pdf

PDF with searchable text (not available for audio)

Returns: Base64-encoded PDF blob (not URL)

Default Values

MCP uses smart defaults for simplified usage

languages"en" (English)

outputs"text" (plain text)

models"auto" (PaddleOCR/Tesseract/Whisper)

For most use cases, only specify sourceUrl. Defaults handle English text extraction and audio transcription perfectly.

Quick Example

Ready to integrate Convert MCP?

Get Started Free Back to MCP Hub