Gradio

Doctra — Document Parser

Parse PDFs, extract tables/charts, preview markdown, and download outputs.

PDF

Use VLM (optional)

VLM Provider

VLM API Key

Status

Page image

Download individual output files

Download all outputs (ZIP)

Tips

On Spaces, set a secret VLM_API_KEY to enable VLM features.
Use Enhanced Parser for documents that need image restoration before parsing (scanned docs, low-quality PDFs).
Use DocRes Image Restoration for standalone image enhancement without parsing.
DocRes tasks: appearance (default), dewarping, deshadowing, deblurring, binarization, end2end.
Outputs are saved under outputs/<pdf_stem>/.
Note: Google Gemini VLM may not be available due to dependency conflicts. Use OpenAI, Anthropic, or other VLM providers.