← Back to MCP Documentation

Convert MCP API

Document OCR, text extraction, and audio transcription via Model Context Protocol

Overview

The Convert MCP API provides AI assistants with document OCR, text extraction, and audio transcription capabilities. Extract text from PDFs, images, Office documents, and transcribe audio files using advanced OCR and speech recognition technology with support for 60+ languages.

MCP Endpoint

https://mcp.ainoflow.io/mcp/v1/convert

Transport:

HTTP (SSE)

Configuration

OpenAI Agent Builder

Add MCP server in the UI dialog:

URL:
Label:
Authentication (Access token / API Key):
Claude Desktop

Add to your Claude Desktop configuration:

Cursor IDE

Add to your Cursor MCP configuration:

Supported File Formats

Images (OCR)

PDF, JPEG, PNG, TIFF, BMP, WebP, GIF

Documents

Word (.doc, .docx), RTF, ODT, TXT

Spreadsheets

Excel (.xls, .xlsx), ODS

Presentations

PowerPoint (.ppt, .pptx), ODP

Audio (Transcription)

WAV, MP3, M4A, MP4, WebM, OGG, FLAC, AAC, Opus

Maximum file size: 100MB

Available Tools

submit_file_url
Submit file from external URL for OCR processing or audio transcription

Parameters

sourceUrl
required

HTTP/HTTPS URL to download the file

languages
optional

Comma-separated ISO codes (default: "en"). Examples: "en", "en,de,fr", "zh-cn,ja"

outputs
optional

Output formats: "text", "pdf", or "text,pdf" (default: "text"). PDF not available for audio.

Example - Document OCR

Example - Audio Transcription

Available Resources

convert://jobs/{id}
Job status with download URLs

Returns: JSON with job status and pre-signed URLs

convert://jobs/{id}/text
Plain text content or audio transcription

Returns: Extracted text directly (not URL)

convert://jobs/{id}/pdf
PDF with searchable text (not available for audio)

Returns: Base64-encoded PDF blob (not URL)

Default Values
MCP uses smart defaults for simplified usage
languages"en" (English)
outputs"text" (plain text)
models"auto" (PaddleOCR/Tesseract/Whisper)

For most use cases, only specify sourceUrl. Defaults handle English text extraction and audio transcription perfectly.

Quick Example

Ready to integrate Convert MCP?

Sign up for free and start using document conversion and audio transcription with your AI agents