Settings

Theme

Show HN: Engine – A multi-LLM alternative to Codex

enginelabs.ai

16 points by sdspurrier 7 months ago · 15 comments

Reader

sudb 7 months ago

I worked on this! Happy to answer any questions anyone has.

atlas_mugged 7 months ago

What are the limitations are there in terms of tasks this can handle? How does this compare with the other products out there? There are plenty of options...

  • sdspurrierOP 7 months ago

    Depends on your set of tasks but we use Engine for the bottom ~50% of issues by complexity. We have a pretty good swe-bench score from a while back but it's got much better since!

    We have also focused on workflow integrations so you can assign issues from Linear, Jira, Trello etc which makes it more useful for teams.

    • jackmpcollins 7 months ago

      Seems to me that integrations will be the most important component of tools like this. As an engineer I get my context from video calls with customers and other engineers, slack messages, emails, docs online, using the product myself, etc. So an auto-engineer should do the same.

diminikolaou 7 months ago

This is cool. I can see the anti-monopoly of OpenAI argument, but apart from that is there a strong argument of being multi-LLM for a Codex-like agent?

  • sdspurrierOP 7 months ago

    We often find that some models perform better on certain types of repo. For example Claude 3.5/7 is typically much better at frontends. That's why we let you switch up the model for each repo.

jackmpcollins 7 months ago

I've already merged my first Engine PR! Being able to review PRs like normal and it updates its work is very cool.

julvo 7 months ago

Looks great! What's your experience of using this for working on real world production code?

simvirdi 7 months ago

Looks cool - do you have any benchmarks? How do you compare to other products out there?

  • sudb 7 months ago

    We last submitted a SWE-Bench verified result in November 2024 - at the time I believe we were in the top 5 entrants.

    We expect Engine to be as good as the other code-writing agents out there at the moment - we understand almost everyone in the space to be using very similar base models and agent scaffolding.

RHSman2 7 months ago

Demo’s this 6 months ago. Super excited to see how far it has come since!!!

ca508 7 months ago

been following engine from afar for a while, super cool to see it on HN. didn't see it had a free plan, will try it out.

ph94robotics 7 months ago

Boom been waiting for something like this!

Keyboard Shortcuts

j
Next item
k
Previous item
o / Enter
Open selected item
?
Show this help
Esc
Close modal / clear selection