EmbedPDF

Layout Analysis

Client-side document structure detection running entirely in your browser. Uses ONNX Runtime (WebAssembly/WebGPU) to run deep learning models locally -- no server, no API keys, your documents never leave the tab.

PP-DocLayoutV3

Apache 2.0

Layout detection -- identifies text blocks, titles, tables, figures, headers, footers, and other structural elements on each page.

PaddleOCR / PaddlePaddle

table-transformer-structure-recognition

MIT

Table structure recognition -- detects rows, columns, column headers, and spanning cells within tables identified by the layout model.

microsoft/table-transformer

Experimental -- will use significant resources

This runs deep learning inference directly in your browser tab. The models are ~60 MB each and inference is CPU/GPU intensive. Expect high memory usage and your tab may become unresponsive during analysis, especially on mobile devices or older hardware.

Loading PDF engine (PDFium via WebAssembly)...

Use it in your project

The layout analysis plugin works with React, Svelte, and Vue. Everything above is built with @embedpdf/plugin-layout-analysis and runs on the headless @embedpdf/core plugin system.