Document Parsing

Document Parsing uses state-of-the-art research techniques to extract and visualize structured data from a wide range of document types, including but not limited to PDFs, images, Word documents (DOC/DOCX), and PowerPoint files (PPT/PPTX). It's designed to handle complex layouts such as tables, charts, and mixed text-image content.

Document Parsing is built on the ai_parse_document function and includes a UI that allows you to parse documents and immediately inspect their structure through formatted text or structured JSON outputs.

Requirements

A workspace that includes the following:
- Serverless compute enabled. See Serverless compute requirements.
- Unity Catalog enabled. See Enable a workspace for Unity Catalog.
- Access to a serverless usage policy with a nonzero budget.
This function is only available in some regions, see AI function availability.
- The ai_parse_document function is also available to workspaces with the Enhanced Security and Compliance add-on.

Parse documents

Use Document Parsing to parse your documents and visualize their structure.

Go to Agents in the left navigation pane of your workspace.
Click Create Agent > Document Parsing.
Select your source document. You can choose to upload a file or select one from an existing Unity Catalog catalog. Supported formats include: PDF, images, DOC/DOCX, and PPT/PPTX.
Click Parse document.

Parsing your document can take a few minutes. When complete, Document Parsing shows the source document on the left and the parsed document on the right. You can choose to view the parsed document as Formatted text or Raw JSON.

Document parsing UI showing source and parsed document side by side

Process and query results

To view the ai_parse_document query and run it on more documents, click Use Agent and choose either to run the query from the SQL Editor or Notebook. You can edit the query to point to the volume or table your documents live in.

Document Parsing provides a UI interface to the SQL function ai_parse_document. See the ai_parse_document reference page for more advanced examples and details.

To prepare the parsed output for retrieval (RAG), use ai_prep_search (Beta) downstream.

Limitations

See ai_parse_document limitations.

Requirements​

Parse documents​

Process and query results​

Limitations​

Requirements

Parse documents

Process and query results

Limitations