PDF to Excel

Extract tabular data from PDF files and convert to Excel spreadsheets with customizable parsing options

Convert PDF documents containing tables to Excel format using pure Node.js.

Features:

  • Extracts text content from PDF files
  • Parses text into table rows using customizable delimiters
  • Auto-detects common delimiters (tab, pipe, comma, semicolon)
  • Creates formatted Excel files with headers
  • Supports multiple pages combined into one sheet

Example Results

1 examples

PDF Table to Excel

Extract tabular data from a PDF and convert to Excel format

pdf-to-excel-output.xlsx View File
View input parameters
{ "sourceFile": "/public/samples/pdf/table-data.pdf", "delimiter": "auto", "hasHeader": true }

Click to upload file or drag and drop file here

Maximum file size: 50MB Supported formats: application/pdf

Key Facts

Category
Documents & PDF
Input Types
file, select, checkbox, text
Output Type
file
Sample Coverage
4
API Ready
Yes

Overview

Easily extract tabular data from PDF documents and convert them into structured Excel spreadsheets. This tool parses text content from your PDFs, automatically detects or applies custom column delimiters, and generates formatted Excel files with options to adjust column widths, skip empty rows, and define custom sheet names.

When to Use

  • When you need to extract financial tables or reports from PDF documents into editable Excel spreadsheets.
  • When you have structured text-based PDF tables and want to avoid manual copy-pasting.
  • When you need to consolidate multi-page PDF tables into a single, formatted Excel sheet with custom column widths.

How It Works

  • Upload your PDF file containing the tables you want to extract.
  • Select your column delimiter or use auto-detect, and configure formatting options like headers, empty row skipping, and sheet name.
  • Click convert to parse the PDF text and download a clean, formatted Excel (.xlsx) file.

Use Cases

Converting PDF financial statements into Excel sheets for budgeting and data analysis.
Extracting product inventory lists or price sheets from PDF catalogs into structured spreadsheets.
Migrating tabular research data from academic PDF papers to Excel for statistical processing.

Examples

1. Extracting Quarterly Financial Tables

Financial Analyst
Background
An analyst receives quarterly performance reports in PDF format containing multi-page tables of revenue and expenses.
Problem
Manually copying the financial tables into Excel is slow and prone to formatting errors.
How to Use
Upload the quarterly report PDF, set the delimiter to 'Auto-detect', check 'First Row Contains Headers', and name the sheet 'Q3_Financials'.
Example Config
Delimiter: auto, Has Header: true, Skip Empty Rows: true, Sheet Name: Q3_Financials
Outcome
A clean Excel spreadsheet with auto-adjusted column widths containing all financial data ready for analysis.

2. Converting Pipe-Delimited PDF Reports

Data Operations Specialist
Background
A system export generates log summaries in PDF format where columns are separated by pipe characters.
Problem
Standard PDF converters fail to align the columns correctly, merging data fields together.
How to Use
Upload the log PDF, select the 'Pipe (|)' delimiter option, and enable 'Skip Empty Rows' to clean up the output.
Example Config
Delimiter: |, Has Header: true, Skip Empty Rows: true, Sheet Name: SystemLogs
Outcome
An organized Excel sheet where each pipe-separated value is correctly mapped to its own column.

Try with Samples

xlsx, xls, pdf

Related Hubs

FAQ

Can this tool convert scanned PDF images?

No, this tool extracts text-based tables from digital PDFs and does not support OCR for scanned images.

How does the auto-detect delimiter option work?

It analyzes the spacing and characters in the PDF text to automatically identify if columns are separated by tabs, commas, pipes, or spaces.

Can I combine tables from multiple pages into one Excel sheet?

Yes, the tool automatically parses tables across multiple pages and compiles them into a single Excel worksheet.

What file formats are supported for output?

The tool outputs standard Excel spreadsheets in .xlsx format.

Is there a file size limit for the uploaded PDF?

Yes, the maximum supported file size for PDF uploads is 50 MB.

API Documentation

Request Endpoint

POST /en/api/tools/pdf-to-excel

Request Parameters

Parameter Name Type Required Description
sourceFile file (Upload required) Yes -
delimiter select No -
hasHeader checkbox No -
skipEmptyRows checkbox No -
autoColumnWidth checkbox No -
sheetName text No -

File type parameters need to be uploaded first via POST /upload/pdf-to-excel to get filePath, then pass filePath to the corresponding file field.

Response Format

{
  "filePath": "/public/processing/randomid.ext",
  "fileName": "output.ext",
  "contentType": "application/octet-stream",
  "size": 1024,
  "metadata": {
    "key": "value"
  },
  "error": "Error message (optional)",
  "message": "Notification message (optional)"
}
File: File

AI MCP Documentation

Add this tool to your MCP server configuration:

{
  "mcpServers": {
    "elysiatools-pdf-to-excel": {
      "name": "pdf-to-excel",
      "description": "Extract tabular data from PDF files and convert to Excel spreadsheets with customizable parsing options",
      "baseUrl": "https://elysiatools.com/mcp/sse?toolId=pdf-to-excel",
      "command": "",
      "args": [],
      "env": {},
      "isActive": true,
      "type": "sse"
    }
  }
}

You can chain multiple tools, e.g.: `https://elysiatools.com/mcp/sse?toolId=png-to-webp,jpg-to-webp,gif-to-webp`, max 20 tools.

Supports URL file links or Base64 encoding for file parameters.

If you encounter any issues, please contact us at [email protected]