Using Llama3.1 405B to generate political synthetic data
oxen.aiWe thought it'd be interesting to see what political biases Llama 3.1 405B has by generating a bunch of "spam" or "ham" messages with it. We started with 5 hand crafted messages and let the LLM take it from there ending up with over 1k.
Full process was documented here:
https://www.oxen.ai/blog/create-your-own-synthetic-data-with...
Next up we are going to train a classifier on the outputs, as well as do some classical NLP (named entities, keywords, sentiment, etc) on it to see what we find.
Mainly a fun side project, but could have some interesting implications assuming candidates are using LLMs in the upcoming elections.
This has the legitimate possibility to be the most telling spam or ham (and funniest) yet. Very interested in the state by state breakdowns.
Is anyone using Oxen.ai? This is the first use of it I've seen.