Privacy
Your PDFs never leave your machine. Here's why you can trust that — and how to verify it.
The Local-Processing Guarantee
When you use Docuoria, your PDF files never leave your machine. The engine reads, extracts, transforms, and renders entirely in-process. No cloud upload. No third-party access. Your documents stay yours.
How We Can Make That Claim
Four technical properties of the engine that back the guarantee.
PDFs are processed as streams, not URLs
The engine API accepts a Stream parameter. Your PDF bytes are never serialised to a URL or transmitted over the network as part of the extraction process.
No PDF bytes travel over the wire
The only data that crosses a network boundary is the template JSON — not the document content. Your PDFs are read in-process and discarded when the pipeline completes.
The API never accepts PDFs
The service layer accepts structured requests only — PDF content is never transmitted to Docuoria. Your document stays on your machine throughout the entire extraction.
Verifiable behaviour
You can inspect network traffic with a proxy tool during an extraction run and verify that no PDF content leaves the machine. The engine is deterministic — same inputs produce the same outputs, every time.
Limitations & Caveats
Three scenarios where data may travel outside your machine — know about them before deploying.
Third-party loggers
If you integrate a logging or observability provider into your application, log statements may capture extracted field values. Review your logging configuration if extracted data is sensitive.
HTTP Template Store
If your templates are loaded from a remote HTTP endpoint, the template JSON (not the PDF content) is fetched over the network. Use a local template source if this is a concern.
HttpRetrievalProvider
If your template uses an HttpRetrievalProvider to enrich extracted data with external lookups, those HTTP requests are made from your machine. Review the template definition for any external endpoints.
Ready to get started?
Docuoria is free to use. Install in minutes — no cloud account, no API key, no data sharing required.