Ask HN: Is the Llama 3.1 model truly open source?
Discussed at length by HN 3 days ago
Thank you. I didn't know that.
Is the Llama 3.1 model open source? Yes.
Really? No.
Training Data = Source Code
This time, the Llama 3.1 model was released as open source. However, the training data is not disclosed. In AI and deep learning, training data is the "source code," while the model is like an app. Therefore, Llama is not truly open source. It's more like a free language model app has been released.
When Meta's Zuckerberg released Llama, he mentioned Linux and posted, "Open Source Al is the Path Forward“.
While Zuckerberg looks cool and we appreciate him,
still, we should understand that Llama is not open source. To be open source, the training data must be disclosed. We should not be under the illusion that Llama is open source. (Training data size: approximately 15 trillion tokens)
My understanding of HN is the parent comment should be part of the question text.
Also you shouldn't make an Ask HN post unless you're actually asking a question. Here OP seems to just want to say their own ideas on the matter.
That, or they forgot to switch accounts and they're trying to karma farm :P
I wanted to argue that Llama 3.1 is not truly open source, and I wanted to discuss this with the public. Looking at previous "Ask HN" posts, it appeared to be a board where both questions and discussions were possible.
Thank you.