Settings

Theme

3T Token Open Corpus for Language Model Pretraining

blog.allenai.org

35 points by thatcherthorn 2 years ago · 5 comments

Reader

version_five 2 years ago

This has some kind of stupid custom license with it that from what I can tell only lets you use models you train on it for "internal use" (or tries to, it's fair use so whatever) . It's getting really shitty to see everyone trying to control how people can use their "contributions" - if it was for commercial reasons I'd understand but it's all this silly "AI harms" garbage. Treat collaborators like adults and let them decide how they want to use ostensibly public domain stuff.

  • two_in_one 2 years ago

    I doubt they can enforce their license on derivative work while ignoring the license of the source.

zwaps 2 years ago

According to the license, AllenAI can just take over (all) the rights and ownership for any derivative works by revoking your usage license, which they can also do at will.

Reasonably speaking, nobody can use this dataset for anything of value. I really wonder who comes up with these "open-source" products with such licenses and why they even bother. I guess Marketing?

sunshadow 2 years ago

Unfortunately the license makes this somewhat useless. Hope they realize that and change it.

ttt3ts 2 years ago

Dumb license. Useless.

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection