Settings

Theme

Structured Generation Improves LLM Performance: GSM8K Benchmark

blog.dottxt.co

11 points by Homunculiheaded 2 years ago · 5 comments

Reader

curionav 2 years ago

Intuitively, regex or json grammar have a much lower "semantic dimension" than what today LLMs allow. Maybe the observed performance gains result from such lower dimensionality.

remilouf 2 years ago

That whole structured generation line of work looks promising. I hope someone else takes this and runs evaluations on other benchmarks. Curious to see if the results translate!

  • HomunculiheadedOP 2 years ago

    Agreed! While these results are very promising, there's still a lot to explore in this space.

    In addition to the "prompt consistency" and "thought-control" ideas mentioned in the post, I'm definitely curious how the performance is on more complex structured data (things like codegen).

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection