Settings

Theme

Extract data from any document layout

extractio.web.app

2 points by warrendlee1 3 years ago · 5 comments

Reader

dnl01 3 years ago

What is the difference between this and something like Docparser?

  • warrendlee1OP 3 years ago

    docparser and most ocr services require the document layout be the same/similar for each template you make.

    extractio doesn't require the user to make any templates and can be used for any generic document layout.

    A perfect use case for extractio would be resumes - candidates submit many different resume layouts, but it would be able to parse out the name, positions, email, etc. without making a template for each layout.

    • andrewio 3 years ago

      Any AI-powered PDF parser doesn't require documents to use "the same layout".

      I'm building Pariso (parsio.io) that can parse invoices, receipts etc. They all may have different layout :)

      • dnl01 3 years ago

        Great tool! Why do you think tools like Docparser exist when Parsio seems to eliminate all the work that a user would do to build templates. Seems like a no-brainer.

        What have you seen as the most common use case for something like Parsio vs Docparser?

        • andrewio 3 years ago

          Docparser has been on the market for years and has a solid customer base, despite the rise of more sophisticated AI-powered tools for data extraction. The switching cost for customers is relatively high in terms of the effort required.

          As for the use cases, we have all kind of it.

          For emails, our customers parse submitted forms and leads to Google Sheets and CRM, extract booking data from Airbnb confirmations, exporting Etsy orders to Trello, extract and filiter HARO queries.

          For PDFs and scanned documents, we have businesses parsing invoices, receipts, quotes, contracts, business cards etc.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection