Settings

Theme

Show HN: LlamaExtract, a tool to automatically extract schema from documents

github.com

8 points by pierre a year ago · 4 comments · 1 min read

Reader

We build LlamaExtract, a tool that allow you to automatically extract a data model from a collection of documents, and then reusing this datamodels (JSON Schema) to extract data from documents.

Available as a Python library and as an API.

Announcement blog: https://www.llamaindex.ai/blog/introducing-llamaextract-beta...

verdverm a year ago

When you say "available as a python library", do you mean the LlamaCloud API wrapped in a Python package? This doesn't seem to be something that we can use without a LlamaCloud account. When I hear that, I think more that this is something I could run locally.

Is there any documentation or a paper on the methods?

Is this intended to be a proprietary service?

cheesyFishes a year ago

Pretty neat initial launch. What's top of mind to add to it?

BinaryBrain a year ago

Does it handles multiple input documents for extraction?

  • pierreOP a year ago

    yes, you can pass an array of path to the extract function.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection