Introducing Doctly’s Extractor Studio: A Faster Way to Build PDF Extractors

3 min read Original article ↗

Ali Basiri

Extracting structured data from PDFs has always been one of the messiest parts of automation. Every document looks slightly different. Tables don’t line up. Dates are written in five different ways. You end up juggling regexes, brittle scripts, and countless re-runs before something finally works.

At Doctly, we’ve been building tools that make working with PDFs less painful. Today, we’re excited to launch Extractor Studio, a new way to design, refine, and publish PDF extractors powered by AI. It’s built for developers, data teams, and product builders who are tired of wasting days on document parsing.

Press enter or click to view image in full size

Invoice Extractor built with Doctly’s Extractor Studio

How Extractor Studio Works

Upload sample PDFs
Start by dropping in a handful of PDFs. They don’t have to be identical — Extractor Studio thrives on variations. The more examples you give it, the better it understands the quirks of your documents.

Press enter or click to view image in full size

Generate JSON output with AI
Once your samples are in, you can ask the AI to create a JSON schema or CSV output for your data. You can either use the Chat or specifically ask for a schema or make changes to the schema or you can start with “I’m Feeling Lucky” then make adjustments through the chat.

Press enter or click to view image in full size

Refine and Compare with Diffs
Building extractors is an iterative process. If you need to make changes after you publish, you can continue your conversation and compare your latest edits against the previous published version with a clean diff view. That way, you always know exactly what changed and why.

Press enter or click to view image in full size

Version control built in
No more wondering which extractor is in production. Every version is saved, tagged, and ready to roll back if you need to.

Publish to an API endpoint
When you’re happy with the extractor, publishing is just one click. Instantly, it’s available at an API endpoint you can call from your applications.

Press enter or click to view image in full size

Custom Extractors With “Ultra”
We can also bulid extractors for you. If you need extreme accuracy, let us know and move your extractor onto our Ultra platform. Giving you the highest possible accuracy and run to run consistency.

The Bigger Picture

We believe PDF extraction shouldn’t feel like a dark art. It should feel like a straightforward part of your data pipeline. Extractor Studio is the next step toward making document intelligence accessible and reliable for every team.

Whether you’re parsing invoices, contracts, reports, or entirely custom document types, Extractor Studio gives you a faster, more collaborative, and more transparent way to get structured data.

Sign-up now @ https://doctly.ai - no credit cards requried.

If you made it this far, email us at support@doctly.ai and we’ll give you an additional 250 credits to get you started.