MarkItDown

CLI installation guide

Install baseline for all IDEs before listing-specific setup.

Using servers guide

Covers runtime usage patterns and auth flow.

learn

Cursor setup walkthrough

High-intent setup path for developer troubleshooting journeys.

Troubleshooting: server not working

Common failure modes for install and runtime issues.

Troubleshooting: tools not showing

Covers discovery/listing failures across major IDEs.

Related explore entry: Memory

Keeps same-intent users on matched category and tool shape.

Related explore entry: Roundtable MCP

Keeps same-intent users on matched category and tool shape.

Related explore entry: Sequential Thinking

Keeps same-intent users on matched category and tool shape.

Validate in Playground Publish Your Server Ecosystem Overview

Related explore entry: Everything MCP Server

Keeps same-intent users on matched category and tool shape.

/// ONE-COMMAND INSTALL

Install this server instantly with the AgenticMarket CLI — zero config, auto-detects your IDE.

$npx agenticmarket install markitdown

BROWSE ALL SERVERS →

TAGGED:open-source markdown pdf openai autogen

/// RELATED SERVERS IN THE ECOSYSTEM

ECOSYSTEM9 tools

Memory

Give your AI assistant persistent memory across conversations. The Memory server stores entities, relations, and observations in a local knowledge graph that persists between sessions.

ECOSYSTEM13 tools

Roundtable MCP

Multi-model AI council MCP server that enables collaborative reasoning for architecture, debugging, code review, and engineering decisions.

ECOSYSTEM1 tools

Sequential Thinking

Enhance your AI assistant's reasoning with structured, step-by-step thinking. Supports revisions, branching, and dynamic adjustment of reasoning depth.

ECOSYSTEM15 tools

Everything MCP Server

The official MCP reference server that exercises every protocol feature — prompts, tools, resources, sampling, and all transports. Built for MCP client developers and testing.

MarkItDown

ECOSYSTEM REFERENCENO AUTHMITOpen Source

Source: @agenticmarket· community reference

MarkItDown now offers an MCP (Model Context Protocol) server for integration with LLM applications like Claude Desktop, VS code, Codex and others

Source@agenticmarket

Tools

Updated

04/16/2026

Setup Guide

~/.cursor/mcp.json

{
  "mcpServers": {
    "markitdown": {
      "command": "uvx",
      "args": [
        "markitdown-mcp@0.0.1a4"
      ]
    }
  }
}

Tools

TOOLS EXPOSED1 TOOL

convert_to_markdown

Compatibility

IDE COMPATIBILITY

✗Antigravity

Not supported

✓VS Code

Tested

✓Codex

Tested

?Gemini CLI

Not verified

✓GitHub Copilot

Tested

Last tested: April 16, 2026

About

MarkItDown

PyPI - Downloads

[!TIP] MarkItDown now offers an MCP (Model Context Protocol) server for integration with LLM applications like Claude Desktop. See markitdown-mcp for more information.

[!IMPORTANT] Breaking changes between 0.0.1 to 0.1.0:

Dependencies are now organized into optional feature-groups (further details below). Use pip install 'markitdown[all]' to have backward-compatible behavior.

convert_stream() now requires a binary file-like object (e.g., a file opened in binary mode, or an io.BytesIO object). This is a breaking change from the previous version, where it previously also accepted text file-like objects, like io.StringIO.

The DocumentConverter class interface has changed to read from file-like streams rather than file paths. No temporary files are created anymore. If you are the maintainer of a plugin, or custom DocumentConverter, you likely need to update your code. Otherwise, if only using the MarkItDown class or CLI (as in these examples), you should not need to change anything.

MarkItDown currently supports the conversion from:

PDF
PowerPoint
Word
Excel
Images (EXIF metadata and OCR)
Audio (EXIF metadata and speech transcription)
HTML
Text-based formats (CSV, JSON, XML)
ZIP files (iterates over contents)
Youtube URLs
EPubs
... and more!

Why Markdown?

Prerequisites

MarkItDown requires Python 3.10 or higher. It is recommended to use a virtual environment to avoid dependency conflicts.

With the standard Python installation, you can create and activate a virtual environment using the following commands:

bash
python -m venv .venv
source .venv/bin/activate

If using uv, you can create a virtual environment with:

bash
uv venv --python=3.12 .venv
source .venv/bin/activate
# NOTE: Be sure to use 'uv pip install' rather than just 'pip install' to install packages in this virtual environment

If you are using Anaconda, you can create a virtual environment with:

bash
conda create -n markitdown python=3.12
conda activate markitdown

Installation

To install MarkItDown, use pip: pip install 'markitdown[all]'. Alternatively, you can install it from the source:

bash
git clone git@github.com:microsoft/markitdown.git
cd markitdown
pip install -e 'packages/markitdown[all]'

Usage

Command-Line

bash
markitdown path-to-file.pdf > document.md

Or use -o to specify the output file:

bash
markitdown path-to-file.pdf -o document.md

You can also pipe content:

bash
cat path-to-file.pdf | markitdown

Optional Dependencies

bash
pip install 'markitdown[pdf, docx, pptx]'

will install only the dependencies for PDF, DOCX, and PPTX files.

At the moment, the following optional dependencies are available:

[all] Installs all optional dependencies
[pptx] Installs dependencies for PowerPoint files
[docx] Installs dependencies for Word files
[xlsx] Installs dependencies for Excel files
[xls] Installs dependencies for older Excel files
[pdf] Installs dependencies for PDF files
[outlook] Installs dependencies for Outlook messages
[az-doc-intel] Installs dependencies for Azure Document Intelligence
[audio-transcription] Installs dependencies for audio transcription of wav and mp3 files
[youtube-transcription] Installs dependencies for fetching YouTube video transcription

Plugins

MarkItDown also supports 3rd-party plugins. Plugins are disabled by default. To list installed plugins:

bash
markitdown --list-plugins

To enable plugins use:

bash
markitdown --use-plugins path-to-file.pdf

To find available plugins, search GitHub for the hashtag #markitdown-plugin. To develop a plugin, see packages/markitdown-sample-plugin.

markitdown-ocr Plugin

Installation:

bash
pip install markitdown-ocr
pip install openai  # or any OpenAI-compatible client

Usage:

Pass the same llm_client and llm_model you would use for image descriptions:

python
from markitdown import MarkItDown
from openai import OpenAI

md = MarkItDown(
    enable_plugins=True,
    llm_client=OpenAI(),
    llm_model="gpt-4o",
)
result = md.convert("document_with_images.pdf")
print(result.text_content)

If no llm_client is provided the plugin still loads, but OCR is silently skipped and the standard built-in converter is used instead.

See packages/markitdown-ocr/README.md for detailed documentation.

Azure Document Intelligence

To use Microsoft Document Intelligence for conversion:

bash
markitdown path-to-file.pdf -o document.md -d -e "<document_intelligence_endpoint>"

More information about how to set up an Azure Document Intelligence Resource can be found here

Python API

Basic usage in Python:

python
from markitdown import MarkItDown

md = MarkItDown(enable_plugins=False) # Set to True to enable plugins
result = md.convert("test.xlsx")
print(result.text_content)

Document Intelligence conversion in Python:

python
from markitdown import MarkItDown

md = MarkItDown(docintel_endpoint="<document_intelligence_endpoint>")
result = md.convert("test.pdf")
print(result.text_content)

To use Large Language Models for image descriptions (currently only for pptx and image files), provide llm_client and llm_model:

python
from markitdown import MarkItDown
from openai import OpenAI

client = OpenAI()
md = MarkItDown(llm_client=client, llm_model="gpt-4o", llm_prompt="optional custom prompt")
result = md.convert("example.jpg")
print(result.text_content)

Docker

sh
docker build -t markitdown:latest .
docker run --rm -i markitdown:latest < ~/your-file.pdf > output.md

Install and Troubleshooting Intent Coverage

Developer-install and troubleshooting intent for community MCP server listings.

install

Owner: /explore/{slug}

install mcp server / mcp server setup guide

setup

Owner: /explore/{slug}

mcp json config example / vscode mcp setup

errors

Owner: /explore/{slug}

mcp server not working / mcp tools not showing

compatibility

Owner: /explore/{slug}

mcp server compatibility matrix / cursor vs vscode mcp compatibility

monetization-discovery

Owner: /explore/{slug}

mcp server monetization options / convert community mcp server to paid listing

Related Setup, Debug, and Learning Links

CLI installation guide

Install baseline for all IDEs before listing-specific setup.

Using servers guide

Covers runtime usage patterns and auth flow.

learn

Cursor setup walkthrough

High-intent setup path for developer troubleshooting journeys.

Troubleshooting: server not working

Common failure modes for install and runtime issues.

Troubleshooting: tools not showing

Covers discovery/listing failures across major IDEs.

Related explore entry: Memory

Keeps same-intent users on matched category and tool shape.

Related explore entry: Roundtable MCP

Keeps same-intent users on matched category and tool shape.

Related explore entry: Sequential Thinking

Keeps same-intent users on matched category and tool shape.