Ask HN: Is archive.org a good place for structured data?

3 points by tfederman 8 months ago · 2 comments · 1 min read

Let's say I put a lot of effort into creating an authoritative data set on a notable topic in a format like CSV, JSON, whatever. And I want to share it and have it housed somewhere more official/permanent than I'll be able to support.

Could archive.org be thought of as a place for raw assembled/authoritative data, with a layer that could transform it into a static web site view to be hosted elsewhere with additional design and features but would not be the authoritative source of the underlying data?

Is there a better place for this sort of vision?

PaulHoule 8 months ago

What about Kaggle? Or GitHub?

tfedermanOP 8 months ago

It's not a big data set that lends itself primarily to analysis, it's more like content. For example, a list of all US Presidents with a lot of metadata or text content fields about them collected/combined from different sources, cleaned, corrected, annotated, etc. (Pretend Wikipedia has only a subset of these fields and considers broadening them out of scope.)
As for Github, the data would still be under "my" account and I'm thinking about more of a platform that doesn't depend on one person. Maybe I would manage day to day version control in Github but I'd want to promote occasional releases to be more official and not reliant on my account.

Settings

Ask HN: Is archive.org a good place for structured data?

Keyboard Shortcuts