✓ Verified 💻 Development ✓ Enhanced Data

Markitdown Skill

OpenClaw agent skill for converting documents to Markdown.

Rating
5 (376 reviews)
Downloads
13,108 downloads
Version
1.0.0

Overview

OpenClaw agent skill for converting documents to Markdown.

Complete Documentation

View Source →

MarkItDown Skill

Documentation and utilities for converting documents to Markdown using Microsoft's MarkItDown library.

Note: This skill provides documentation and a batch script. The actual conversion is done by the markitdown CLI/library installed via pip.

When to Use

Use markitdown for:

  • 📄 Fetching documentation (README, API docs)
  • 🌐 Converting web pages to markdown
  • 📝 Document analysis (PDFs, Word, PowerPoint)
  • 🎬 YouTube transcripts
  • 🖼️ Image text extraction (OCR)
  • 🎤 Audio transcription

Quick Start

bash
# Convert file to markdown
markitdown document.pdf -o output.md

# Convert URL
markitdown https://example.com/docs -o docs.md

Supported Formats

FormatFeatures
PDFText extraction, structure
Word (.docx)Headings, lists, tables
PowerPointSlides, text
ExcelTables, sheets
ImagesOCR + EXIF metadata
AudioSpeech transcription
HTMLStructure preservation
YouTubeVideo transcription

Installation

The skill requires Microsoft's markitdown CLI:

bash
pip install 'markitdown[all]'

Or install specific formats only:

bash
pip install 'markitdown[pdf,docx,pptx]'

Common Patterns

Fetch Documentation

bash
markitdown https://github.com/user/repo/blob/main/README.md -o readme.md

Convert PDF

bash
markitdown document.pdf -o document.md

Batch Convert

bash
# Using included script
python ~/.openclaw/skills/markitdown/scripts/batch_convert.py docs/*.pdf -o markdown/ -v

# Or shell loop
for file in docs/*.pdf; do
  markitdown "$file" -o "${file%.pdf}.md"
done

Python API

python
from markitdown import MarkItDown

md = MarkItDown()
result = md.convert("document.pdf")
print(result.text_content)

Troubleshooting

"markitdown not found"

bash
pip install 'markitdown[all]'

OCR Not Working

bash
# Ubuntu/Debian
sudo apt-get install tesseract-ocr

# macOS
brew install tesseract

What This Skill Provides

ComponentSource
markitdown CLIMicrosoft's pip package
markitdown Python APIMicrosoft's pip package
scripts/batch_convert.pyThis skill (utility)
DocumentationThis skill

See Also

Installation

Terminal bash

openclaw install markitdown-skill
    
Copied!

💻Code Examples

markitdown https://example.com/docs -o docs.md

markitdown-httpsexamplecomdocs--o-docsmd.txt
## Supported Formats

| Format | Features |
|--------|----------|
| PDF | Text extraction, structure |
| Word (.docx) | Headings, lists, tables |
| PowerPoint | Slides, text |
| Excel | Tables, sheets |
| Images | OCR + EXIF metadata |
| Audio | Speech transcription |
| HTML | Structure preservation |
| YouTube | Video transcription |

## Installation

The skill requires Microsoft's `markitdown` CLI:

pip install 'markitdown[pdf,docx,pptx]'

pip-install-markitdownpdfdocxpptx.txt
## Common Patterns

### Fetch Documentation

print(result.text_content)

printresulttextcontent.txt
## Troubleshooting

### "markitdown not found"
example.sh
# Convert file to markdown
markitdown document.pdf -o output.md

# Convert URL
markitdown https://example.com/docs -o docs.md
example.sh
# Using included script
python ~/.openclaw/skills/markitdown/scripts/batch_convert.py docs/*.pdf -o markdown/ -v

# Or shell loop
for file in docs/*.pdf; do
  markitdown "$file" -o "${file%.pdf}.md"
done
example.py
from markitdown import MarkItDown

md = MarkItDown()
result = md.convert("document.pdf")
print(result.text_content)
example.sh
# Ubuntu/Debian
sudo apt-get install tesseract-ocr

# macOS
brew install tesseract

Tags

#web_and-frontend-development

Quick Info

Category Development
Model Claude 3.5
Complexity Multi-Agent
Author karmanverma
Last Updated 3/10/2026
🚀
Optimized for
Claude 3.5
🧠

Ready to Install?

Get started with this skill in seconds

openclaw install markitdown-skill