Ask HN: How does my usage of LLMs train the underlying models?

1 points by emehex 7 months ago · 1 comment · 1 min read

I understand that using tools like Chat, Cursor, and Claude Code for software development is likely providing training data to help these LLMs get better at coding (the irony isn't lost on me that I might be contributing to making myself obsolete...)

But I'm curious about the actual mechanics: How exactly does this feedback loop work? When I accept, reject, or modify the code that these models spit out, is that signal fed directly back into training?

Not necessarily against this, just genuinely curious about how the sausage is made.

incomingpain 7 months ago

>ow exactly does this feedback loop work? When I accept, reject, or modify the code that these models spit out, is that signal fed directly back into training?

My understanding is that essentially nobody uses our inputs as training anymore. It makes the models worse when they were doing this.

Garbage data in, garbage out.

They want to control and use only quality data as their training data.

Settings

Ask HN: How does my usage of LLMs train the underlying models?

Keyboard Shortcuts