docsmith-mcp

![npm version](https://www.npmjs.com/package/docsmith-mcp)

Python-powered document processing MCP with MCP Apps — Process Excel, Word, PDF, PowerPoint documents with ease using Python, and view them beautifully through an interactive MCP App.

Features

- Excel: Read/write .xlsx files with sheet support and pagination
- Word: Read/write .docx files with paragraph and table support
- PDF: Read .pdf files with text extraction and pagination
- PowerPoint: Read .pptx files with slide content extraction
- Text Files: Read/write .txt, .csv, .md, .json, .yaml, .yml with pagination support
- Run Python: Execute Python code for flexible file operations and data processing
- MCP App: Beautiful React + Tailwind CSS app for viewing all document types
- Flexible Reading Modes: Raw full read or paginated for large files
- Powered by Pyodide: Runs in secure WebAssembly sandbox via code-runner-mcp

Quick Start

$3

Add to your MCP client configuration (e.g., Claude Desktop, Cline, etc.):

Via npx (recommended):
``json { "mcpServers": { "docsmith": { "command": "npx", "args": ["-y", "docsmith-mcp"], "env": { "DOC_PAGE_SIZE": "100" } } } }`

Via global installation:`bash npm install -g docsmith-mcp`

`json { "mcpServers": { "docsmith": { "command": "docsmith-mcp", "env": { "DOC_PAGE_SIZE": "100" } } } }`

Via local path:`json { "mcpServers": { "docsmith": { "command": "node", "args": ["/path/to/docsmith-mcp/dist/index.js"] } } }`

Then use the read_document tool:

`json { "file_path": "/path/to/document.xlsx", "mode": "paginated", "page": 1, "page_size": 50 }`

The MCP App will automatically open to display the document content beautifully.

`Supported Formats`

| Format | Extensions | Read | Write | Notes | |--------|-----------|------|-------|-------| | Excel |.xlsx| ✅ | ✅ | Multi-sheet support, pagination | | Word |.docx| ✅ | ✅ | Paragraphs and tables | | PDF |.pdf| ✅ | ❌ | Text extraction with pagination | | PowerPoint |.pptx| ✅ | ❌ | Slide content extraction | | CSV |.csv| ✅ | ✅ | - | | Text |.txt, .md| ✅ | ✅ | Pagination support | | JSON |.json| ✅ | ✅ | - | | YAML |.yaml, .yml | ✅ | ✅ | - |

`Tools`

`$3`

Read document content with automatic format detection.

Parameters: -file_path(string, required): Path to the document -mode (string, optional): "paginated" or "raw" (default: "paginated") -page(number, optional): Page number for paginated mode (default: 1) -page_size(number, optional): Items per page (default: 100) -sheet_name (string, optional): Sheet name for Excel files

Example:`json { "file_path": "/path/to/document.xlsx", "mode": "paginated", "page": 1, "page_size": 50, "sheet_name": "Sheet1" }`

`$3`

Write document content.

Parameters: -file_path(string, required): Output path -format (string, required): "excel", "word", "csv", "txt", "json", "yaml"-data (array/object, required): Document content

Example:`json { "file_path": "/path/to/output.xlsx", "format": "excel", "data": [ ["Product", "Q1", "Q2"], ["Laptop", 100, 150], ["Mouse", 500, 600] ] }`

`$3`

Get document metadata without reading full content.

Parameters: -file_path (string, required): Path to the document

Example:`json { "file_path": "/path/to/document.pdf" }`

`$3`

Execute Python code for flexible file operations, data processing, and custom tasks. Supports any file format and Python libraries.

Parameters: -code(string, required): Python code to execute -packages(object, optional): Package mappings (import_name -> pypi_name) for required dependencies -file_paths (array, optional): File paths that the code needs to access

Examples:

Read and process any file:`json { "code": "import json\nwith open('/path/to/file.json') as f:\n data = json.load(f)\n result = len(data)\n print(json.dumps({'count': result}))", "file_paths": ["/path/to/file.json"] }`

Batch rename files with regex:`json { "code": "import os, re\nfolder = '/path/to/files'\nfor name in os.listdir(folder):\n new_name = re.sub(r'old_', 'new_', name)\n os.rename(os.path.join(folder, name), os.path.join(folder, new_name))\nprint(json.dumps({'success': True}))", "file_paths": ["/path/to/files"] }`

Process data with pandas:`json { "code": "import pandas as pd\ndf = pd.read_csv('/path/to/data.csv')\nsummary = df.describe().to_dict()\nprint(json.dumps(summary))", "packages": {"pandas": "pandas"}, "file_paths": ["/path/to/data.csv"] }`

Extract archive files:`json { "code": "import zipfile, os\nwith zipfile.ZipFile('/path/to/archive.zip', 'r') as z:\n z.extractall('/path/to/output')\nfiles = os.listdir('/path/to/output')\nprint(json.dumps({'extracted_files': files}))", "file_paths": ["/path/to/archive.zip", "/path/to/output"] }`

`MCP App`

The built-in MCP App provides a beautiful, interactive interface for viewing documents:

- Excel: Interactive tables with sticky headers - PDF: Page-by-page text viewing - Word: Paragraph and table rendering - PowerPoint: Slide navigation

Built with React 19, Tailwind CSS v4, and Lucide icons.

`Configuration`

Environment variables for customizing behavior:

| Variable | Description | Default | |----------|-------------|---------| |DOC_RAW_FULL_READ | Enable full raw read mode | false| |DOC_PAGE_SIZE | Default items per page | 100| |DOC_MAX_FILE_SIZE | Max file size in MB | 50` |

Contributing

See CONTRIBUTING.md for development setup and contribution guidelines.

License

MIT

docsmith-mcp

![npm version](https://www.npmjs.com/package/docsmith-mcp)

Python-powered document processing MCP with MCP Apps — Process Excel, Word, PDF, PowerPoint documents with ease using Python, and view them beautifully through an interactive MCP App.

Features

Quick Start

$3

Add to your MCP client configuration (e.g., Claude Desktop, Cline, etc.):

Via npx (recommended):
``json { "mcpServers": { "docsmith": { "command": "npx", "args": ["-y", "docsmith-mcp"], "env": { "DOC_PAGE_SIZE": "100" } } } }`

Via global installation:`bash npm install -g docsmith-mcp`

`json { "mcpServers": { "docsmith": { "command": "docsmith-mcp", "env": { "DOC_PAGE_SIZE": "100" } } } }`

Via local path:`json { "mcpServers": { "docsmith": { "command": "node", "args": ["/path/to/docsmith-mcp/dist/index.js"] } } }`

Then use the read_document tool:

`json { "file_path": "/path/to/document.xlsx", "mode": "paginated", "page": 1, "page_size": 50 }`

The MCP App will automatically open to display the document content beautifully.

`Supported Formats`

`Tools`

`$3`

Read document content with automatic format detection.

Example:`json { "file_path": "/path/to/document.xlsx", "mode": "paginated", "page": 1, "page_size": 50, "sheet_name": "Sheet1" }`

`$3`

Write document content.

Parameters: -file_path(string, required): Output path -format (string, required): "excel", "word", "csv", "txt", "json", "yaml"-data (array/object, required): Document content

Example:`json { "file_path": "/path/to/output.xlsx", "format": "excel", "data": [ ["Product", "Q1", "Q2"], ["Laptop", 100, 150], ["Mouse", 500, 600] ] }`

`$3`

Get document metadata without reading full content.

Parameters: -file_path (string, required): Path to the document

Example:`json { "file_path": "/path/to/document.pdf" }`

`$3`

Execute Python code for flexible file operations, data processing, and custom tasks. Supports any file format and Python libraries.

Examples:

`MCP App`

The built-in MCP App provides a beautiful, interactive interface for viewing documents:

- Excel: Interactive tables with sticky headers - PDF: Page-by-page text viewing - Word: Paragraph and table rendering - PowerPoint: Slide navigation

Built with React 19, Tailwind CSS v4, and Lucide icons.

`Configuration`

Environment variables for customizing behavior:

Contributing

See CONTRIBUTING.md for development setup and contribution guidelines.

License

MIT

docsmith-mcp

Dist Tags

docsmith-mcp

Features

Quick Start

$3

`Supported Formats`

`Tools`

`$3`

`$3`

`$3`

`$3`

`MCP App`

`Configuration`

Contributing

License

docsmith-mcp

Dist Tags

docsmith-mcp

Features

Quick Start

$3

`Supported Formats`

`Tools`

`$3`

`$3`

`$3`

`$3`

`MCP App`

`Configuration`

Contributing

License