Settings

Theme

Ask HN: Is archive.org a good place for structured data?

3 points by tfederman 5 months ago · 2 comments · 1 min read


Let's say I put a lot of effort into creating an authoritative data set on a notable topic in a format like CSV, JSON, whatever. And I want to share it and have it housed somewhere more official/permanent than I'll be able to support.

Could archive.org be thought of as a place for raw assembled/authoritative data, with a layer that could transform it into a static web site view to be hosted elsewhere with additional design and features but would not be the authoritative source of the underlying data?

Is there a better place for this sort of vision?

PaulHoule 5 months ago

What about Kaggle? Or GitHub?

  • tfedermanOP 5 months ago

    It's not a big data set that lends itself primarily to analysis, it's more like content. For example, a list of all US Presidents with a lot of metadata or text content fields about them collected/combined from different sources, cleaned, corrected, annotated, etc. (Pretend Wikipedia has only a subset of these fields and considers broadening them out of scope.)

    As for Github, the data would still be under "my" account and I'm thinking about more of a platform that doesn't depend on one person. Maybe I would manage day to day version control in Github but I'd want to promote occasional releases to be more official and not reliant on my account.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection